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RANDOM  PACKING  AND  RANDOM  COVERING  SEQUENCES 
Clifton  Dickerson  Sutton 


In  a  sequential  packing  problem,  random  objects  are  uniformly  and  independently  selected 
from  some  space.  A  selected  object  is  either  packed  or  rejected,  depending  on  the  distance 
between  it  and  the  nearest  object  which  has  been  previously  packed.  A  saturated  packing 
is  said  to  exist  when  it  is  no  longer  possible  to  pack  any  additional  selections.  The  random 
packing  density  is  the  average  proportion  of  the  space  which  is  occupied  by  the  packed  objects 
at  saturation. 

Results  concerning  the  time  of  the  first  rejection  in  a  packing  sequence  are  given  in 
Chapter  1.  The  accuracy  of  some  common  approximation  formulas  is  investigated  for  several 
settings.  The  problems  considered  may  be  thought  of  as  generalizations  of  the  classical  birthday 
problem. 

Exact  results  concerning  random  packing  densities  are  generally  known  only  for  some 
packing  sequences  in  one- dimensional  spaces.  In  Chapter  2,  the  packing  densities  of  various 
computer  generated  codes  are  examined.  These  stochastically  constructed  codes  provide  a  con¬ 
venient  way  to  study  packing  in  multidimensional  spaces.  Asymptotic  approximation  formulas 
are  given  for  the  packing  densities  which  arise  from  several  different  coding  schemes.  In  one 
special  case  considered,  a  new  method  is  found  for  approximating  a  planar  density.  The  result  y 
obtained  agrees  closely  with  estimates  obtained  by  others. 

In  Chapter  3  the  distribution  of  the  number  of  random  selections  needed  to  achieve  a 
saturated  packing  is  considered.  In  each  of  the  settings  examined,  the  results  are  compared 
with  analogous  results  from  an  associated  random  covering  problem. 
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Chapter  0 


Introduction 


Sequential  packing  and  covering  problems  have  been  investigated  by  numerous  au¬ 
thors;  however,  there  are  quite  a  few  interesting  questions  which  have  remained  unan¬ 
swered.  Here  some  stopping  rules  arising  in  various  packing  and  covering  sequences  are 
examined,  and  some  fresh  results  concerning  their  distributions  are  presented. 

The  basic  packing  problem  can  be  briefly  described  as  follows.  Random  objects  are 
sequentially  selected  from  some  space.  A  selected  object  is  considered  packed  if  it  does 
not  overlap  any  of  the  previously  packed  objects,  otherwise  the  object  is  rejected.  The 
selection  process  continues  until  it  is  no  longer  possible  to  pack  any  additional  objects.  The 
chief  problem  is  to  determine  how  much  of  the  space  is  covered  by  the  terminal  collection 
of  packed  objects.  So  far.  explicit  solutions  have  been  obtained  only  for  packing  sequences 
on  one-dimensional  spaces. 

Chapter  1  examines  the  time  of  the  first  rejection  in  packing  sequences.  The  prob¬ 
lems  considered,  which  will  be  called  collision  problems,  may  be  viewed  as  generalizations 
of  a  familiar  birthday  problem  where  people  are  sequentially  sampled  at  random  until  a 
matching  of  birthdays  occurs.  Applying  the  usual  simplifying  assumptions,  this  simple 
birthday  setting  may  be  modeled  as  follows.  A  sequence  Cj,  C2,  •  •  •  of  independent  selec¬ 
tions  are  made  from  the  space  5  =  {1,2,  •••,«},  where  n  represents  the  number  of  days 
in  a  year.  The  selections  are  made  according  to  a  uniform  distribution,  i.e. 


P{C,  =  k)  =  n"1 


for  1  <  k  <  n  and  for  each  t.  Letting  r  count  the  number  of  selections  needed  to  obtain 
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the  first  duplication  of  outcomes,  it  is  well  known  that  for  fixed  t  >  0 


A  new  result  established  in  Chapter  1  yields,  for  this  simple  birthday  setting,  that 

(0.3)  P{r  >  |naJ}  ~  exp  as  n  —  oo 

for  0  <  q  <  2/3.  In  other  words  (0.3)  says  that  the  common  approximation  formula  (see 
[14]) 

(0.4)  P{t  >  k)  =  exp(— k2 /2n), 

which  is  suggested  by  (0.1),  will  hold  reasonably  well  whenever  k  <  n 2^3  and  n  is  suffi¬ 
ciently  large.  A  direct  calculation  in  a  related  collision  problem  suggests  that  the  equiva¬ 
lence  stated  in  (0.3)  does  not  hold  for  values  of  a  which  are  greater  than  or  equal  to  the 
upper  bound  2/3. 

Other  settings  for  collision  problems  are  also  examined  in  Chapter  1.  For  the  one¬ 
dimensional  cases  of  packing  arcs  of  equal  length  on  the  discrete  and  continuous  circles, 
with  a  collision  occurring  if  two  arcs  overlap,  results  similar  to  (0.1),  (0.2),  and  (0.3)  are 
obtained.  However,  for  collision  problems  in  higher  dimensional  spaces  only  analogs  of 
(0.1)  are  proved  exactly,  with  plausible  arguments  and  simulation  results  used  to  provide 
support  for  the  approximation 

(0.5)  £M  a  ^ 

which  seems  to  hold  in  a  large  number  of  cases.  In  (0.5),  p  is  used  to  denote  the  probability 
that  two  arbitrary  random  selections  collide. 

The  time  to  the  first  collision  is  also  investigated  for  several  variations  where  arcs 
of  unequal  length  are  packed  on  the  circle.  These  collision  settings  do  not  seem  to  have 


Chapter  O'  Introduction  3 


been  previously  considered  by  others.  For  cases  where  the  arc  lengths  are  i.i.d.  random 
variables,  upper  and  lower  bounds  are  obtained  for  P{t  >  k).  and  results  are  given  which 
indicate  that  the  approximation  formula  (0.5)  is  not  applicable  for  these  settings. 

Random  packing  densities  for  various  random  coding  schemes  are  investigated  in 
Chapter  2.  Stochastically  constructed  codes  provide  a  convenient  way  to  study  random 
packing  in  high-dimensional  spaces,  and  a  few  special  cases  yield  limiting  values  for  packing 
densities  which  may  be  compared  with  analogous  values  obtained  by  others. 

Itoh  and  Solomon  [27]  have  studied  the  densities  for  cases  where  binary  codewords  are 
randomly  packed  by  Hamming  distance.  They  obtained  approximation  formulas  for  the 
densities  in  some  of  the  cases  which  they  investigated.  In  Section  2.2  a  general  approxima¬ 
tion  formula  is  proposed  which  not  only  does  better  than  the  approximations  given  in  [27], 
it  also  does  reasonably  well  for  several  additional  cases.  Letting  n  denote  the  total  number 
of  possible  codewords  and  p  be  the  probability  that  two  arbitrary  codewords  collide,  it  is 
found  that  the  random  packing  densities  appear  to  be  asymptotically  approximated  by  a 
function  of  np  which  contains  only  one  empirical  constant. 

Other  sections  in  Chapter  2  examine  random  q- ary  codes  packed  using  various  met¬ 
rics.  For  the  packing  scheme  discussed  in  Section  2.5,  a  two-dimensional  analog  of  a 
one-dimensional  result  of  Mackenzie  [36]  is  obtained.  This  new  formula  yields  an  approxi¬ 
mation  for  a  planar  packing  density  which  is  in  close  agreement  with  estimates  determined 
in  previous  studies. 

The  total  number  of  random  selections  required  to  achieve  a  saturated  packing  is 
examined  in  the  final  chapter.  Chapter  3  also  considers  the  total  number  of  selections 
required  to  completely  cover  the  space.  In  this  variation,  the  random  objects  are  allowed 
to  overlap  and  none  of  the  selections  are  rejected,  and  it  is  of  interest  to  determine  how 
many  selections  are  required  until  each  point  of  the  space  is  covered  by  at  least  one  object. 
A  few  such  covering  results  have  been  found  by  Flatto  and  several  others;  however,  the 
time  to  saturation  does  not  seem  to  have  been  previously  considered.  It  is  found  that  the 
average  number  of  selections  required  for  saturation  exceeds  the  average  number  required 
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for  coverage  in  some  settings,  while  the  opposite  is  true  in  other  cases.  One  easily  proved, 
yet  somewhat  surprising  result,  is  that  for  some  packing  sequences  on  continuous  spaces 
the  expected  number  of  selections  required  for  saturation  is  infinite. 


V.v.v.v 
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Chapter  1 


The  Time  to  the  Initial  Collision 
of  a  Packing  Sequence 


1.1.  Introduction 

Let  5  be  either  a  finite  space,  a  fc-dimensional  unit  torus  ( k  €  {1,2.  •••}),  or  the 
surface  of  a  sphere  in  Ek  (k  €  {2, 3,  •••}),  and  suppose  that  points  C\  6  S,  C2  €  S,  •  •  •  are 
independently  selected  according  to  a  uniform  distribution.  Thus  if  5  is  the  finite  space 
{sj,s2.  •  ■  -  then 

P{Ct  =  s,}  =  n-1 

for  i  =  1. 2.  •  •  •  and  1  <  j  <  n.  If  5  is  infinite  then 

P{Ci  €  B)  =  |||  (**1,2,...), 

where  |  •  |  denotes  Lebesgue  measure  and  B  is  any  measurable  set. 

Let  p  be  a  metric  on  5.  Then  for  some  specified  <5  >  0,  two  points  C,  and  Cj  are  said 
to  collide  provided  that  p(C,,C;)  <  6.  If  this  condition  is  not  met,  then  the  points  are 
said  to  be  disjoint.  A  set  of  three  or  more  points  is  said  to  be  disjoint,  or  fairly  packed,  if 
the  points  are  pairwise  disjoint.  In  the  literature,  a  collision  is  sometimes  called  a  match 
or  a  coincidence. 

Let  C,  A  Cj  denote  the  event  that  C,  and  C}  collide.  If  C,  and  C;  are  disjoint,  write 
C,  vCj.  Denoting  (J^_,{Ct}  by  C(j).  write  C*  aC(;')  if  C*  collides  with  at  least  one  member 
of  C(j).  Let  p  denote  the  probability  that  two  arbitrary  points  collide,  i.e.  p  =  P{C,  A C;} 
(?  ^  j).  Note  that  p  will  always  be  positive. 
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Consider  Ci .  C2.  •  •  •  chosen  independently  and  uniformly  from  S.  5,  p.  P,  £  and 
{C,}^  will  collectively  be  called  a  packing  sequence.  Define  a  stopping  time  r  by 

{r  =  &}  =  {#C(k  -  1)  =  k  -  1,  C(k  -  1)  is  disjoint,  Ck  A  C(k  -  1)}. 

Thus  7  equals  k  if  and  only  if  Ck  is  involved  in  the  first  collision  that  occurs  as  the  points 
Ci,C2,---  are  selected  one  after  another.  Furthermore,  if  r  exceeds  k ,  then  the  first  k 
points  of  the  sequence  are  pairwise  disjoint.  Using  the  terminology  introduced  above,  it 
can  also  be  said  that  the  first  k  points  have  been  fairly  packed  whenever  r  exceeds  k.  For 
this  reason  r  is  called  the  time  to  the  initial  collision  of  a  packing  sequence. 

For  certain  choices  of  S.  p,  and  6,  r  is  a  random  variable  for  which  some  results  are 
known.  For  example,  several  varieties  of  what  are  commonly  called  birthday  problems  can 
be  treated  by  suitably  defining  5,  /x,  and  6.  In  Sections  1.3,  1.4,  1.7, and  l.S  which  follow, 
some  known  results  concerning  r  will  be  reviewed  and  some  extensions  and  new  results  will 
be  obtained,  for  some  particular  metric  spaces.  Simple  applications  will  be  mentioned, 
and  some  examples  will  be  presented.  Sections  1.5  and  1.6  will  examine  some  related 
problems  for  which  not  all  of  the  criteria  needed  for  a  packing  sequence  as  defined  here 
are  met.  However  before  proceeding  with  any  specific  packing  sequence,  it  is  convenient 
to  derive  first  some  results  which  are  true  for  a  large  class  of  choices  of  (S,  p)  and  f>. 

1.2.  Sequences  of  Equivalent  Points 

A  packing  sequence  will  be  said  to  consist  of  equivalent  points  if  the  probability  that 
a  randomly  selected  point  collides  with  any  given  element  of  the  space  5  is  equal  to  the 
probability  that  two  arbitrary  points  collide.  That  is,  the  sequence  has  equivalent  points 
if  for  each  i  and  for  every  j  6  S 

P{C,  As}  =  P{p(C„s)<  6}  =p. 

It  will  be  seen  in  the  next  section  that  the  packing  sequence  associated  with  the 
classical  birthday  problem  consists  of  equivalent  points.  So  do  the  remaining  packing 
sequences  of  Section  1.3,  and  also  those  discussed  in  Sections  1.4  and  1.7.  The  first 
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packing  sequence  discussed  in  Section  1.8  consists  of  equivalent  points;  however,  it  will 
be  shown  that  t he  alternative  packing  sequence  for  ternary  n-tuples  found  in  that  section 
does  not. 

As  a  simple  example  of  a  packing  sequence  which  does  not  consist  of  equivalent 
points,  consider  the  one  having  S  =  { 1. 2.  3. 4. 5. 6, 7}.  n(x,y)  =  |x  -  y\.  and  S  =  3.  Here 
P{C ,  a  1}  =  P{C ,  A  7}  =  = ,  P{C,  A  2}  =  P{Ct  A  6}  =  |.  and  P{C ,  A  3}  =  P{Ct  A  4}  = 
P{C\  A  5}  =  4.  Since 


’■Hi  * 


2  /4\  3  / 5\  29 


it  is  clear  that  the  sequence  does  not  possess  the  equivalent  points  property  that  P{C,  A 
s)  =  p  for  every  s  €  S.  In  both  this  example  and  the  non-equivalent  points  case  of  Section 
1.8.  it  can  be  noted  that  the  metric  p  creates  a  boundary  effect  on  the  space  S.  If  the 


metric  above  is  changed  to 


p(r.  y)  = 


|z  -  y|  if  \x  -  y\  <  3 
7  -  |t  —  y\  otherwise 


in  order  to  impose  a  torus  quality  on  the  points  of  5.  then  the  packing  sequence  now- 
consists  of  equivalent  points.  This  is  because  for  each  s  E  5 

P{C,  A  5}  =  p  =  y. 

The  results  presented  in  this  section  hold  for  any  6  >  0  and  metric  space  (5,  yi)  which 
provide  a  packing  sequence  of  equivalent  points.  The  corresponding  uniform  probability 
measure  P  will  always  be  as  specified  in  Section  1.1. 

A  lower  bound  for  P{t  >  fc}  is  easy  to  establish. 

Proposition  1.1.  For  k  >  2, 


l  0  otherwise 
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Proof.  Using  Boole's  inequality 

P{r  =  k\  r>k-l}  =  P{Ci  A  C*  u  Cj  A  C*  u  ■  •  •  u  C*_i  A  C*  |  r  >  k  -  1} 

<  ACk  |r  >  *-  1} 

;=i 

*-i 

=  ^P{CJAC*} 
j=i 

=  (*-  1)P- 

Thus 

P{r  >  k)  =  P{r  >  k  -  1}  -  P{t  =  *} 

=  (1  -  P{t  =  it  |r  >  Jt-  l})P{r  >  k-  1} 

>  (1  -(*-  l)p)P{r  >  *-  1}. 

For  p  <  (k  -  1)_1.  simple  induction  yields 

P{r  >  k)  l)p)(l-  (*-2)p)  -  (l  -p)P{r  >  1}. 

Noting  that  P{t  >  1}  =  1  completes  the  proof.  | 

It  can  be  seen  b>  examining  (1.11)  of  the  next  section  that  this  lower  bound  is 
sharp  for  the  sequence  of  the  simple  birthday  problem.  Thus,  on  the  average,  the  waiting 
time  for  a  coincidence  will  tend  to  be  at  least  as  long  for  any  other  packing  sequence 
consisting  of  equivalent  points  and  having  the  same  value  for  p  (equal  to  n_I  )  as  it  is  for 
the  corresponding  simple  birthday  sequence. 

Since  (1  -  jp)  becomes  cumbersome  to  compute  for  large  k,  it  i<  desirable  to 
establish  a  more  convenient  lower  bound.  Lemma  1.1  gives  a  lower  bound  which  can  be 
easily  used  to  establish  asymptotic  results  concerning  the  distribution  of  r. 

Lemma  1.1.  For  any  real  numbers  n  and  o  satisfying  n  >  3  and  a  <  and  any 
integer  k  such  that  2  <  k  <  n, 

-kn-°)  if  k  <  n° 

0  otherwise 
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Proof.  The  case  for  k  >  n°  is  trivial.  For  k  <  observe  that 

k-\ 


fi(i-L). _ nzi 

**  V  n  )  nk~l(n  -  k) 


k)T(n  -  k) 

Judicious  application  of  the  double  inequality 

(1.1)  \/2*xT~,/Vr  <  T(i)  <  V2irxr~1/7e~T+Th 

to  the  gamma  functions  above  yields 
k-\ 


nH)>K)' 


Hence  it  suffices  to  show 

-n+k-l/2 


0-5)' 


-i 


or  equivalently 


Since  the  expansion 


-l 


l 

exp 

(  k  1 

V  12(n  —  1:) 

(# 

jt  1  ^ 

V2n 

12(n  —  k) ) 

k  > 

1  +  2n  1 

b6(rh)=,+i'I  +  5'J+ 


yields  the  inequality 
(1.3)  log(l  -  t)~l  >  V  -fm  (0  <  /  <  1,  M>  1), 


M 


m=l 


it  follows  that  the  left  side  of  (1.2)  exceeds 


(1.4) 


,Jk  1  ( k\7\  k  k2  ,  1 

'""‘'U+jUj  +^  +  2j‘‘'T2(^ 


k  k 3 


1 


n°  2n2  12(n  -  k)' 

Thus  it  suffices  to  show  that  (1.4)  is  greater  than  zero,  or  equivalently  that 

k2 


12( 


n°  V  2 n2~°  J 


(1.5) 
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But  this  is  indeed  the  case  since  the  left  side  of  (1.5)  exceeds 


O' ^ 

=  24(n1-°  -  1)  (l  - 

>  24(n^3  -  1)  ^1  -  -—■'j 

>  12(31/3-  1) 

>1.  I 


The  corollary  below,  follows  immediately  from  Proposition  1.1  and  Lemma  1.1. 

Corollary.  For  a  real  and  k  an  integer,  if  p.  a  and  k  satisfy  p  <  5.  a  <  §  and  k  >  2. 

then 


P{t  >  k) 


>  e'fyi  -  kpa)  if  k<p~a 

>  0  otherwise. 


An  alternative  lower  bound  for  ^"^(l  —  jp)  can  be  obtained  by  a  different  technique. 
Although  this  lower  bound  is  tighter  than  the  one  found  above,  note  that  the  nontrivial 
portion  still  requires  that  k  does  not  exceed  n2^3. 


Lemma  1.2.  For  any  n  >  3  and  any  integer  k  >  2, 


< 


>  e 


1  - 


l  >0 


if  k  <  n2'3 
otherwise. 


Proof.  Since 


log(l  -t)>  -t  -t2 

for  0  <  t  <  and  since  £  <  |  for  1  <  j  <  k~  1  whenever  the  conditions  of  the  hypothesis 


WA’MVk'k'  H.W  1W  VW  V.  TOTCTOramnTOgOT  rjtf  V  V  TP  *3  '1'*'  v  g»  i>  rt  '.U  VW  <''’VT".  \  V1 \  ’ 
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are  met  and  k  <  n2/3,  it  follows  that  if  k  <  n2/3  then 


n(1-:)'“p{§lo,(1_0} 

f  (k-l)k  _  ( k  -  l)k(2k-  1)1 
\  2n  6n2  J 


=  exp 
>  exp 


!-tl  _  ill 

\  2 n  3n2  J 


Then  since  e"‘  >  1  -  t  for  all  t  <  3  and  since  it  follows  from  above  that  if 

k  <  n2/3  then 


Corollary.  For  any  p  <  \  and  any  integer  k  >  2, 


P{ 


l>o 


otherwise. 

It  is  also  possible  to  establish  an  upper  bound  for  P{r  >  A:}. 
Proposition  1.2.  Let  Af  be  the  largest  integer  m  satisfying 
(2m2-5m+l)  (m  -  l)5  2 


-p  + 


24 


<  I- 


Then 


P{t  >  k)  < 


t~  tirrLp  for  2  <  k  <  M 


( M-ll 3  _ 

e~  2  p  otherwise. 


Proof.  Since  P{CT  A  C,,Ct  A  Cu}  =  p 2  if  the  set  {r,s}  is  not  identical  to  the  set 
{t,u},  it  follows  by  a  well-known  inclusion-exclusion  argument  that  for  k  >  3, 


P{t  >  k)  =  P  ^  pi  C,  v  Cj 
1 !<■<;<* 

<  1  -  ^P{Cr  A  C,}  +  f{CrAC„C,ACu} 

r<»,t<u,s<u 


r<s 


a 


s 

SCI 


a 


a 


I 


A'j 

$ 


SI 


&*■ 


5y 

& 


II 
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(1.6) 


=  1 


<  *  ~  i 

Now  truncating  the  Taylor  expansion  of  e  r* yields 


(> 

(1.7)  e~  7 


The  condition  that  (1.6)  is  less  than  or  equal  to  the  right  side  of  (1.7)  simplifies  to 


l.S) 


(2F-5t+1)  +(k  -!)%,<! 
4  24 


Hence  it  follows  that 


P{r  >k)  <e' 


whenever  (1.8)  is  true  and  3  <  k  <  1  +  Over  k  >  2,  the  left  side  of  (1.8)  increases 

with  k  and  is  not  bounded  from  above,  so  that  the  existence  of  a  unique  M  is  guaranteed. 
Furthermore,  whatever  p  is,  2  <  M  <  1  +  The  proposed  upper  bound  has  now  been 
established  for  3  <  k  <  M.  For  k  >  M  it  is  trivial  that 


(M-n* 


P{t  >k}<  P{t  >  M)  <  e  5  p. 


For  k  =  2  note  that 

P{r  >  2}  =  1  -  p  <  e_p/2.  | 


Corollary.  Let 


M(P)  = 


b"I/2l  if  p  >  rb 

l(p)  1  otherwise. 


Then 


P{t  >k)  < 


(k  — l)2 

for  2  <  k  <  A/(p) 
e-ALJL£lp  otherw  ise. 


■'  ^  -•  ->  v  ^  v  v  v  v  v  w  v  vr/  wvvrarowv; 
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Proof 


Note  that  M(p)  =  min{[p-1/2],  [ ^  J}.  Hence 


(2m2  -  5m  +  1)  (m  -  l)5  2 

- 3 - p+_ 2?~ p 


m=Af(p)+l 

m=Jl/(p)+l 


<1™^  (m-n^  , 

-  2  y  24  ^ 

.  W2  ,  KP)1/5]5  2 

<  i, 

and  so  M{p)  -f  1  does  not  exceed  the  value  A/  specified  in  the  hypothesis  of  Proposition 
1.2.  Thus  it  follows  from  the  proposition,  and  the  fact  that  P{r  >  k }  is  nonincreasing 
with  k.  that  the  proposed  upper  bound  is  valid.  I 

Proposition  1.3.  For  fixed  t  >  0, 

P  <  — ■==  >  t  >  ~  e“<2  as  p  i  0. 

lv/27 P  I 

Proof.  For  p  sufficiently  small,  the  preceding  corollary  may  be  used  to  attain  the 
following  upper  bound. 

p{^rP>,j=f{r>l,'/^i} 

<  exp  |-i({fv/27p]  -  l)2pj 

<  exp{-f2  -  At^/p/2}. 

Similarly,  by  the  corollary  following  Lemma  1.1  (with  a  =  |). 

p{wv>'}  > ^ 

These  two  bounds,  plus  the  sandwiching  theorem  for  limits,  yield 


lim 

p— o+ 


P{r^j2>  t)  = 

e-0 


as  required. 
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It  should  be  noted  that  for  the  special  case  of  the  simple  birthday  problem  the  result  given 
in  the  previous  proposition  is  known  to  Diaconis  [9]  and  probably  others  who  have  studied 

the  problem. 

The  result  stated  in  the  next  proposition  is  of  a  different  form  than  the  more  common 
types  of  collision  problem  results,  as  many  previous  results  treat  P{r  >  A'}  for  either  k 
fixed  or  k  being  constrained  to  be  of  the  order  0(p~1//2).  The  restriction  that  a  be  less 
than  l  arises  from  a  similar  restriction  in  Proposition  1.2.  In  Sections  1.3  and  1.4,  where 
the  distribution  of  r  is  known  exactly,  the  following  results  will  be  strengthened  to  hold 
for  all  0  <  q  <  For  the  special  case  of  a  =  5  this  result  may  be  obtained  for  numerous 
settings  from  a  Poisson  limit  of  Silverman  and  Brown  [52]  Their  results,  which  arise  from 
a  method  involving  the  T-statistics  of  Hoeffding  [21],  will  be  discussed  further  in  the  next 
section. 

Proposition  1.4.  For  0  <  a  < 

P{T  >  b"°l}  ~  exp(-^p1_2Q)  as  p  1  0. 
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3 


Also,  by  the  corollary  following  Lemma  1.1, 

liminf  — — -  >  liminf(l  -  lp~a]p1^2)  exp 
p— 0+  expl-ip1"2*)  -  p— »0+  v  Lr  If  '  r 


>  liminf(l  -  p1^2-a)exp(ip1_2a  -  -p1_2a  ; 

p— »0+  2  2 


so  that  it  may  now  be  concluded  that 

,im  £li>tp-°ii , 

p— o*  exp(-ip1“2°) 

Next  a  lower  bound  for  E[t]  will  be  given.  A  similar  upper  bound  for  E[r\  cannot  be 
produced  (by  the  author)  in  this  general  setting  since  a  sufficiently  tight,  upper  bound  for 
P{r  >  k)  has  not  been  determined  over  a  large  enough  range  of  k.  In  subsequent  sections, 
where  specific  metric  spaces  are  considered,  nice  upper  bounds  for  E[r]  are  found  in  those 
cases  where  the  distribution  of  r  is  known  precisely. 

•t 

Theorem  1.1.  Let  a  €  (3.  §).  Then  for  p  <  5  -  Q 

E^>\frP 

T'  jwer  bound  is  asymptotically  equivalent  to  as  p  i  0. 

Proof.  Since  r  is  a  nonnegative,  integral-valued  random  variable,  it  follows  from  a 
well-known  argument  that 

E[T)  =  jrp{T>k}. 

k= 0 

However,  since  r  >  2  with  probability  1,  it  is  also  true  that 

OC 

£[r]  =  2  +  £P{r  >  k}, 
k=2 

and  so  by  the  corollary  following  Lemma  1.1 

tr-al  , 

£[r]  >  2  +  ^  e~trPCi  -  kP°)- 
k= 2 
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j2 

Since  e~  *  P(1  -  xpa )  is  a  positive,  decreasing  function  of  i  on  (2 ,p  a), 

/■*+!  XJ  /P~“  .2 

£[r]  >  2  +  >  /  e  T P(1  -  xpa)dx  +  /  e“ T P(1  -  xpa)dx 

Jk 

P”°  si 

=  2+X  1  '<■  -  "">*• 

x2 

Since  e~  2  p(  1  -  xp°)  is  less  than  1  on  (0, 1)  and  is  negative  on  (p  00),  it  further  follows 


f00  si 

£[r]  >  /  e  2  P(1  -  xpa)dx 

Jo 


2  p  p1- 


as  claimed.  Also  note  that 


1.3.  The  Birthday  Surprise  Revisited 


*VV  V  V- 


Let  5  =  {1, 2.  ■  •  •.  r)}  and  let 


p(x.y)  = 


\*  -  yl  if  \x  —  y\  <  7 

n  —  |x  —  y|  otherwise. 


Recalling  that  <5  represents  the  minimum  separation  distance  for  disjoint  points,  consider 
first  the  simple  case  where  f  is  taken  to  be  1.  Then  two  points  C,  £  5  and  C}  6  S  will 
collide  if  and  only  if  p(C,,Cj)  =  0,  a  condition  requiring  that  C{  and  C:  be  the  same 
element  from  5. 

The  sequential  selection  of  Ci , C2,  •  •  •  from  S  may  be  thought  of  as  dropping  balls 
labeled  Cj.  C2,  •••  into  cells  labeled  1,2,  •••,«.  For  each  drop,  the  probability  of  the  ball 
landing  in  cell  j  is  just  £  ( j  =  1,2,  •  •  -  ,n).  Also  the  probability  that  two  arbitrary  points 
C,  and  Cj  collide  is  since  whatever  cell  the  ball  C,  falls  into  there  exists  a  1  in  n  chance 
that  Cj  will  occupy  the  same  cell.  Clearly,  the  packing  sequence  consists  of  equivalent 
points. 


i‘tj'Lt’1  ji  J'll 


m 


S‘%  *.  A 
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The  initial  collision  in  the  sequence  will  occur  the  first  time  a  dropped  ball  lands  in 
a  cell  which  is  already  occupied  by  a  previously  allocated  ball,  i.e.  t  counts  the  number  of 
drops  required  to  produce  the  first  double  occupancy.  Problems  concerning  the  distribu¬ 
tion  of  r  are  generally  referred  to  as  birthday  problems.  A  common  example,  sometimes 
called  the  birthday  surprise,  is  to  determine  the  smallest  number  of  people  needed  in  order 
for  there  to  be  at  least  an  even  chance  that  some  pair  of  them  will  share  the  same  birthday, 
i.e.  for  n  =  365  find  the  smallest  value  of  k  such  that  P{t  >  *}  < 

Some  basic  aspects  of  birthday  problems  are  discussed  in  [14],  and  numerous  authors 
have  investigated  extensions  and  generalizations  of  the  simple  versions  of  the  problem. 
McKinney  [37]  calculates  the  probability  that  at  least  r  out  of  n  randomly  selected  people 
have  the  same  birthday,  and  Klamkin  and  Newman  [31]  determine,  asymptotically,  the 
expected  number  of  people  needed  in  order  for  r  of  them  to  have  the  same  birthday.  If 
there  are  n  days  in  the  year,  then  the  expected  number  needed  is  asymptotically  equivalent 


(r!)^T  (l  +  ^  n(1*'> 


as  7i  tends  to  infinity.  For  the  case  of  r  =  2.  their  theorem  vields  the  result 


y'y  as  77  —  00 

where,  as  before,  r  is  the  time  of  the  first  match.  Blaum  et  al.  [6]  present  more  accurate 
asymptotic  estimates  than  tl,  e  given  in  [31],  and  they  also  consider  the  number  of  people 
needed  in  order  to  have  k  different  birthdays  occur  at  least  r  times  each.  Several  other 
variations  are  to  be  discussed  in  a  forthcoming  book  by  Diaconis  and  Mosteller,  among 
them  are  problems  in  which  birthdays  do  not  occur  with  equal  likelihood,  multivariate 
versions,  and  “near"  coincidences.  Among  their  findings  in  each  of  the  settings  investigated 
is  an  approximation  for  the  number  of  people  needed  in  order  for  there  to  be  an  even  chance 
for  a  match. 

Now  consider  the  case  where  6  equals  2.  If  the  n  cells  are  arranged  in  a  circle, 
and  balls  are  dropped  into  them  at  random,  then  r  counts  the  number  of  balls  required 


£ 


4 


a 
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to  obtain  the  first  occurrence  of  either  any  cell  being  doubly  occupied  or  of  any  pair  of 
adjacent  cells  being  singly  occupied.  Clearly 

p  =  P{C,  aC;}  =  ^ 

since  whatever  cell  C,  occupies  there  will  be  a  collision  if  C:  lands  in  the  same  cell  or 
either  of  the  two  cells  adjacent  to  it. 

Another  way  to  view  the  situation  is  as  follows.  Consider  a  circle  of  circumference 
n  which  is  divided  into  n  segments,  each  segment  being  of  arc  length  1.  The  segments 
are  labeled  in  order  1,2,-  •  -  ,n  so  that  the  nth  segment  is  adjacent  to  the  1st  segment.  If 
C,  =  j  n)  then  place  an  arc  of  length  2  on  the  circle  so  that  it  covers  exactly  the  jth 
and  ( j  +  l)th  segments.  If  C,  =  n  then  let  an  arc  cover  the  nth  and  1st  segments.  A 
collision  will  occur  whenever  any  portion  of  two  arcs  are  o%-erlapping. 

This  circular  representation  is  also  convenient  if  b  G  {3. 4,  •  •  •}.  At  each  step,  if  C,  =  j 
then  cover  the  segments  labeled  j,  j  £  1,  •  •  • ,  j  £  (f  —  1)  with  an  arc  of  length  6.  where  £ 
stands  for  addition  modulo  n.  The  initial  collision  occurs  the  first  time  any  portion  of  the 
circle  becomes  twice  covered.  Thus  r  equals  k  if  the  first  k  —  1  arcs  are  pairwise  disjoint 
and  the  Ath  arc  overlaps  at  least  one  of  the  first  k  —  1  arcs.  Note  that  fi(C,.Cj)  =  0  if 
C,  =  Cj.  Otherwise  n(C,.Cj)  is  just  one  greater  than  the  number  of  segments  which  lie 
between  the  C,th  segment  and  the  C;th  segment,  where  the  shortest  possible  portion  of 
the  circle  connecting  the  two  segments  is  considered. 

For  convenience,  in  this  metric  space  the  stopping  time  r  will  be  written  T(,  n,  where 
6  identifies  the  minimum  separation  for  disjoint  points  and  n  is  the  number  of  elements 
in  S.  For  example.  P{r7,365  >  k}  is  just  the  probability,  under  reasonable  simplifying  as¬ 
sumptions,  that  in  a  group  of  k  people  sampled  at  random,  no  pair  of  them  have  birthdays 
less  than  a  week  apart.  This  type  of  birthday  problem  seems  to  have  been  investigated 
first  by  Abramson  and  Moser  [  1] .  and  subsequent  work  on  extensions  has  been  done  by 
Sevast'vanov  [48]  and  Diaconis  [9]  among  others. 

For  another  example,  consider  the  packing  problem  discussed  in  [43]  and  [36].  There 


AdBk£L*fc*«49lLl 
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a  line  of  integral  length  n  is  filled  sequentially  at  random  with  nonoverlapping  intervals  of 
integral  length  a,  their  end  points  having  integer  coordinates.  If  a  selected  interval  were  to 
overlap  one  which  has  already  been  packed,  then  the  interval  is  rejected  and  another  one 
is  selected  at  random.  The  process  continues  until  it  is  no  longer  possible  to  fairly  pack 
another  interval  on  the  line.  Disregarding  end  effects,  the  event  {ra,n  >  k}  corresponds  to 
having  none  of  the  first  k  intervals  rejected. 

For  c  6  {1,2,--*},  rCin  is  a  random  variable  having  {2,3,  •  •  -,  [~]  +  1}  as  its  set  of 
possible  outcomes.  P{rc<n  >  k}  =  1  for  all  integers  k  <  2  and  P{rc-Tl  >  k)  =  0  for  all 
integers  k  >  |[|J.  For  2  <  k  <  [^J.  Abramson  and  Moser  [l]  proved  that 


An  equivalent  form  is 


(1.10) 


[n  -  k{c-  1)  -  l)(n  -  k(c  -  1)  -  2)  -  -  (n  -  k{c  -  1)  -  (k  -  1)) 


Setting  c  equal  to  1  yields 


a  well-known  expression  for  the  probabilities  of  the  basic  birthday  problem. 

The  major  advancement  contained  in  this  section  is  given  by  Proposition  1.10,  which 
states  that  for  0  <  a  <  | 


{T- >  [[(srh) 


as  n  —  oc. 


Note  that  this  result  implies  that,  for  large  n,  only  for  values  of  k  near  \/n  will 
P{Tc.n  >  fc}  be  something  other  than  almost  zero  or  almost  one,  a  fact  which  is  known  by 
Diaconis  [9]  and  perhaps  others  who  have  investigated  the  birthday  problem.  However, 
note  that  the  result  also  provides  a  way  of  estimating  what  the  precise  value  of  P{rcn  >  k } 
is  when  it  is  close  to  0  or  1.  provided  that  q  is  not  too  large. 
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Letting  the  9th  quantile  of  rc-T1.  denoted  by  be  defined  as  the  smallest  integer  £ 
satisfying  P{r  <  >  q,  the  above  proposition  suggests  that  should  be  reasonably 

approximated  by 


(,  = 


(27TT)log<T37) 


if  q  <  1  -  exp 


2  (2,-1) 


1/3' 


.  It  turns  out,  however,  that  the  approximation  is  rather 


good  for  even  larger  q.  This  is  demonstrated  by  the  following  example.  Let  c  =  2  and 

1/3] 


n  =  3000.  Then  1  -  exp 


-»(*) 


=  1  —  e  5  =  .993.  A  comparison  of  P  and  P  for 


various  values  of  q  is  presented  below. 


9 

£9 

£9 

0.5 

38 

38 

0.9 

68 

68 

0.99 

96 

96 

0.993 

99 

100 

0.995 

103 

103 

0.999 

117 

118 

0.9995 

122 

124 

0.9999 

134 

136 

1  -  10~5 

150 

152 

1  -  10"7 

176 

180 

1  -  IQ"9 

199 

204 

It  is  interesting  to  note  that  although  the  difference  between  122  and  124  seems  slight, 
the  difference  between  P{r  >  122}  =  4.88  X  10-4  and  P{r  >  124}  =  3.77  x  10-4  is  more 
pronounced.  Similarly,  the  difference  between  199  and  204  does  not  appear  to  be  nearly  as 
drastic  as  the  difference  between  P{r  >  199}  =  9.3  x  10-10  and  P{t  >  204}  =  3.1  x  10~10. 

For  the  special  case  of  a  =  5,  the  result  of  Proposition  1.10  has  been  obtained  by 
several  investigators.  These  include  Sevast’yanov  [48],  Silverman  and  Brown  [52],  Diaconis 
[9],  and  Stein  [56],  In  fact,  for  this  special  case  it  is  possible  to  state  more  powerful  results 
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than  that  which  is  furnished  by  Proposition  1.10.  These  more  powerful  results  are  generally 
known  as  Poisson  approximations. 

Consider  the  basic  birthday  problem  where  there  are  k  people,  n  days  in  a  year,  and 
it  is  considered  that  a  match  occurs  whenever  a  pair  of  people  have  a  common  birthday. 
For  this  setting  the  Poisson  approximation  loosely  says  that  if  (j)i  is  approximately  equal 
to  some  fixed  value  A  and  n  is  sufficiently  large,  then 

e~  ^  A-* 

P{#  matches  =  j}  =  — . 

]■ 

It  is  not  even  necessary  to  require  that  all  birthdays  occur  with  equal  likelihood,  although 
some  restrictions  on  the  probabilities  are  needed.  Note  that  since  k  is  required  to  be  of 
the  order  0{n 1^2).  tne  j  =  0  version  of  the  Poisson  approximation  is  indeed  similar  to 
the  q  =  \  case  of  Proposition  1.10.  Similar  Poisson  approximations  can  be  obtained  in 
collision  settings  other  than  the  one  discussed  here. 

Now  methods  similar  to  those  of  the  previous  section  will  be  used  to  develop  bounds 
for  P{~c,n  >  k}.  These  bounds  will  in  turn  be  used  to  establish  limit  results  for  the 
distribution  of  rc>n.  including  Proposition  1.10. 

It  follows  from  (1.10)  that 

(1.12)  log  P{rc,n  >£}  =  ^  log  1  -  (—  +  J)  ■ 

J=1  L  \  n  /. 


The  inequality 


log(l  -  x)  <  -x  (0  <  x  <  1) 


may  be  applied  to  each  of  the  terms  on  the  right  hand  side  of  (1.12),  resulting  in 


=  (*  -  i)*(2c~  jj 

2  n 

Thus  the  following  proposition  is  immediately  established. 
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Proposition  1.5. 


<  fe  —  1  .H3C-1  . 

P{rc,n  >  k)  <  e  K 


Noting  that  p  =  ■  c~—  .  the  next  proposition  is  just  a  special  case  of  the  corollary 
following  Lemma  1.1. 

Proposition  1.6.  For  a  real  and  k  an  integer,  if  n  >  3(2c  —  1),  a  <  |  and  k  >  2. 

then 

»*<(*)• 

l  >  0  otherwise. 

An  alternative  lower  bound  is  provided  by  the  following  proposition. 

Proposition  1.7.  Let  c.  n,  and  k  be  integers  such  that  2  <  c  <  1728.  n  >  c3.  and 


k  >  2.  Let 


M(c.  n)  = 


(t)2/3  if  c  <  1  +  ( j)1 
otherwise. 


P{re.n  >  k)  |  >  0  <"-fc(e-l))*>0  eXp  (  2(n — fc(c—  1 ) ) )  *  <  A/(c.  *1 1 

l  >  0  otherwise. 


Proof.  Consider  <  A/(c,n)  since  the  other  case  is  trivial. 
For  convenience  let  *  denote  (c  -  1).  Note  that 


so  that 


or  equivalently 


Therefore,  for  k  <  M{c.  n] 


A/3/2  +  7 A/  <  n. 


A/  <  (n  -  7A/)J/3- 


Jfc  <  (n  -  7lt)2/3. 
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I 

ti 


% 

£ 


It  follows  from  (1.9)  and  (1.1)  that 

PsT  >M  =  _ H IZJDI _ 

*  c‘"  n*-1(n  —  kf  -  k)T(n  -  kf  -  k) 

(n  -  ki)T'-k~,~1/2  exp(-n  +  kf) 


nk~1(n  -  kf  —  k)n~ky~k+}/2  exp(-n  -f-  kf  +  k  + 


i 


1  - 


(£) 

x  exp  (  -k  - 


1 


(»-i 

-Dl  7^ 


12(n-v-r-/c)  ■ 
n  —  fc-v  —  /t  + 1  /2 


12(n  -  kf  -  k) 


The  inequality 


H) 


r  — 1 


>  e-1  (x  >  1) 


applied  to  the  expression  above  yields 

n  -n+ki+k- 1/2 


P{re,n  >  k)  >  1  (n_it7) 


/  (*- 

eXPl-(^ 


■  IHl  _  t _ 

*7)  12( 


n  -  kf  -  kj) 


It  now  suffices  to  show  that  if  k  <  (n  -  7 k )2^3  then 


(  k  V 

-n+k~r+k  — 1/2  ■ 

1  -  (  -  ) 

\n-kfj. 

. 

1  - 


1" 


x  exp 


(n  —  kf  )2/3j 

k2(2f  +  1 )  (Jt  —  1)A*7  .  1 


2(n  -  kf)  (n  -  kf) 
exceeds  1,  or  equivalently  that 


-k- 


12(77  -  kf  -  k)J  ‘ 


(n  -  kf  -  k  +  i)log 


1  - 


k 


-1 


+  log 


{n-  k 7) 
fr2(27  4-  1)  _  (1-  -  1)1-7 
+  (77-67)  (n-kf) 

is  greater  than  0.  By  (1.3)  the  above  expression  exceeds 


1  - 

-  k- 


(77  -  k 7  )2/3 
1 


(n  -  kf  -  k) 


12(n  -  kf  -  k) 

k2( 27  +  1)  (A- -l)A-7 

(n  -  kf)  ^  2(n  -  &7)2J  '  (n  -  kf)2/3  '  2{n  ~  kf)  (77-67) 

-6- 


k 2 


+ 


12(n  -kf-k) 

k  -\2f{n-kf)-k2  +  2(n-kf)A/3]~ 


2(77  -  kf) 

k 

2(77  -  kf)2 


12(77  -  kf  -  k ) 


[2(77-*7)4/3-*2]- 


1 


12(n  -  67  -  k) 


k 


ttSac. 
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which  is  greater  than  1  whatever  y  €  {1,  2,  •  •  •}  may  be.  | 

The  following  proposition  is  just  a  special  case  of  Proposition  1.3. 

Proposition  1.9.  For  fixed  t  >  0, 

p{Tc'n/V^7^T  ><}  ~e"‘5  45  n^oc- 

Since  the  upper  bound  furnished  by  Proposition  1.5  is  valid  over  a  larger  range  than 
is  the  upper  bound  given  by  Proposition  1.2,  the  result  given  in  Proposition  1.4  may  now 
be  extended  over  a  larger  range  as  well.  The  proof  of  the  following  proposition  is  omitted 
since  it  is  entirely  similar  to  the  proof  of  Proposition  1.4. 

Proposition  1.10.  For  0  <  a  <  I, 


p{r‘->  IGrn)  J}~exp f-i(sh) 


as  n  —  oc. 


In  order  to  investigate  the  accuracy  of  the  other  formulas  presented  thus  far  in  this 
section,  consider  as  an  example  the  simple  birthday  problem  where  c  equals  1  and  n  equals 


365.  (1.11 )  becomes 


P{Tl.365  >  k)  = 


-n(*-5fe) 

:= i  x  7 


(it  =  2. •••,365). 


Proposition  1.5  gives 


(1.18) 


P{t i,365  >  k)  <  e' 


{k  =  2. •••,365) 


and  Proposition  1.6  provides 


(1.19) 


[  >  0  (it  >  51)  . 


Proposition  1.9  suggests  the  often  used  approximation. 


(1-20) 


P{r 1,365  >  fc}  »  c  . 


Below  the  right  hand  sides  of  (1.17),  (1.18).  (1.19),  and  (1.20)  are  compared  for  several 
values  of  k. 
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k 

lower  bound 

(1.19) 

exact  prob. 

( 1-17) 

upper  bound 

(1.18) 

approx. 

(1.20) 

5 

0.872 

0.973 

0.973 

0.966 

10 

0.701 

0.883 

0.884 

0.872 

20 

0.351 

0.589 

0.594 

0.578 

30 

0.120 

0.294 

0.304 

0.291 

40 

0.024 

0.109 

0.118 

0.112 

50 

6  4  x  10-4 

0.030 

0.035 

0.033 

60 

0 

0.006 

0.008 

0.007 

It  can  be  seen  above  that  the  upper  bound  is  reasonably  tight,  but  that  the  lower 
bound  is  not.  It  may  also  be  observed  that  the  approximation  given  by  (1.20)  is  neither 
an  upper  bound  nor  a  lower  bound.  It  underestimates  the  true  probability  for  small  %-a!ues 
of  k,  and  overestimates  for  larger  k. 

Putting  c  equal  to  7  and  proceeding  like  before  yields  the  following  results. 

1 

(121)  P{rr.36s  >  *}  =  -fc_T  f] (365  -  6k  -  ;)  (Jb  =  2.- - -.52). 

°  ;=i 


I 
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simple 

improved 

exact 

upper 

>.Y 

lower  bound 

lower  bound 

probability 

bound 

approx. 

k 

(1.23) 

(1.25) 

(1.21) 

(1.22) 

(1-24) 

& 

3 

0.606 

0.794 

0.896 

0.899 

0.852 

5 

0.320 

0.552 

0.689 

0.700 

0.641 

•'  /■ 

10 

0 

0.093 

0.171 

0.201 

0.168 

20 

0 

1.2  X  10"5 

2.3  x  10"4 

1.2  x  10"2 

8.1  x  10- 

i 

30 

0 

1.4  x  10“15 

2.3  x  lO"10 

1.9  x  10_? 

1.1  x  10-' 

Observe  that  the  upper  bound  again  outperforms  the  lower  bounds,  and  also  that 
the  approximation  formula  behaves  similar  to  the  one  in  the  previous  example.  It  is 
interesting  to  note  that  as  poor  as  the  simple  lower  bound  of  Proposition  1.6  (or  equiv¬ 
alently  of  the  corollary  following  Lemma  1.1)  appears  to  be.  it  is  tight  enough  to  help 
establish  the  asymptotic  formula  for  E{r]  stated  in  Theorem  1.2.  which  follows.  The 
looseness  of  the  lower  bound  arises  from  the  fact  that  the  linear  factor  [l  -  k  (^p)°] 
severely  overcompensates  for  the  tail  probabilities  decaying  faster  than  th<=  exponential 
factor  exp  (^~k2  'Y'*)  ■ 

Lower  bounds  for  the  exact  probabilities  in  the  two  examples  above  may  also  be 
produced  from  the  corollary  following  Lemma  1.2.  The  values  obtained  (which  are  not 
reported  here)  are  at  least  as  good  as  those  given  by  (1.19)  and  (1.23)  for  all  values  of  k: 
however,  while  the  values  gotten  from  the  corollary  are  better  than  those  given  by  (1.25) 
for  some  values  of  k,  they  are  worse  for  other  values  of  k. 

Now  a  result  for  the  mean  will  be  derived  using  previous  results  for  the  distribution 

of  r. 

Theorem  1.2.  For  a  packing  sequence  consisting  of  equivalent  points,  if 


(fc-n2 

P{t  >  k)  < 


1 
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for  k  >  2.  then 


Proof.  Bv  Theorem  1.1 


£iTl  V2 P  **  Pl°’ 


£M 

lim  —j=d  >  1. 
p— o*  z. 

V 


Recalling  from  the  proof  of  Theorem  1.1  that 


£[r]  =  2+  £p{r  >  k). 


it  follows  that  if  P{r  >  k)  <  exp  —  1  ~ —Pj  then 

e\t]  ^ (2^Er=,f-*5/2F) 

lim  — ==  <  lim  - P= - 

p-0+  '  -0+  y/i 

=  lim  M Ye-$’ 
p_°*  V  tr  £ 

<  lim  r  e-'-r’di 

p—o*  \  *  Jo 

=  lim  1 

p— o* 


This  result  and  (1.26)  together  imply 


£M  =  j 


^0+  x/5 


as  required  to  complete  the  proof.  ■ 


Corollary.  £[rc-n] 


as  n  —  cx. 


Another  method  may  also  be  used  to  obtain  an  asymptotic  result  concerning  E[tc  „]. 
Asymptotic  results  for  the  mean  are  also  found  in  [6]  and  [31]. 

Let  the  sequence  of  random  variables  be  defined  by 

<  =  ^  (n  =  2. 3.  ■  ■ 
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It  follows  from  Proposition  1.5.  Proposition  1.6  and  calculus  that  for  any  o  >  0 

limP{r;  >  q)  =  e~™2 

n 

where  *>  =  2^1.  Hence  r'n  converges  in  law  to  a  random  variable  having  Maxwell's 
distribution.  Since 


=  0. 


the  random  variables  rr>  are  also  uniformly  integrable.  Thus  it  now  follows  i  see  [3' /  that 
E'jn]  converges  to  the  mean  of  Maxwell's  distribution,  i.e. 


r 


V  2(  2c  -  1 )  ’ 


1.4.  Random  Arcs  On  a  Circle 


In  the  setting  of  the  previous  section,  if  c  is  taken  to  be  very  large  and  n  considerably 
larger,  then  the  packing  sequence  of  placing  arcs  of  length  c  uniformly  on  a  circle  of  length 
n  so  that  their  endpoints  have  integer  coordinates  is  very  nearly  the  same  as  placing  arcs 
of  length  j  on  a  circle  of  unit  circumference  with  their  midpoints  chosen  according  to  a 
uniform  (0. 1]  distribution.  So  in  a  sense,  collision  problems  for  random  arcs  on  a  circle 
are  just  continuous  versions  of  birthday  problems,  with  one  of  the  more  basic  problems 
being  to  determine  the  probability  that  a  set  of  k  random  arcs  will  be  pairwise  disjoint. 

In  terms  of  the  notation  of  Section  1.  the  packing  sequence  for  arcs  of  length  a  on  a 
circle  corresponds  to  putting  5  =  (0. 1], 

f  k  -  y\  if  I*  -  y|  <  5 

pfx.y\  =  i 

I  1  —  \x  -  yj  otherwise. 

and  t  ■=  a.  The  packing  sequence  has  equivalent  points 
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The  distribution  of  the  time  to  the  initial  collision.  ra,  may  be  obtained  from  the 
work  of  Stevens  [57j.  It  is  found  that  for  0  <  a  <  1  and  k  an  integer. 


(  1 


P{ra  >  k)  = 


(1  -  ka)k~l 


0 


if  it  <  1 

if  1  <  it  <  fa"1] 
if  k  >  [a-1]. 


The  essential  step  in  Stevens'  proof  is  the  establishment  of  a  one  to  one  correspon¬ 
dence  between  configurations  of  k  arcs  having  no  overlaps  with  configurations  of  arcs  for 
which  none  of  the  endpoints  Ci .  C 3.  ■  ■  • .  C*  lie  in  a  specified  region  of  length  ka.  Since  the 
probability  that  k  -  1  random  points  fall  outside  of  a  region  of  length  ka  is  just 


(1  -  ka  )*“' . 


the  above  formula  follows 

In  contras;  to  the  unwieldy  product  formula  of  the  previous  section,  the  expression 
(1  -  ka)K~l  can  be  easily  computed  with  a  hand  held  calculator  even  if  k  is  very  large. 
Nevertheless,  for  the  purpose  of  establishing  asymptotic  results  it  is  convenient  to  derive 
upper  and  lower  bounds  for  P{r 2  >  k}. 

It  should  also  be  mentioned  that  the  asymptotics  of  Stevens'  formula  have  been 
thoroughly  studied  because  of  its  connections  with  time  series.  One  such  interesting 
connection  is  du»  to  the  following  fact: 

where  A'i A'ic  are  i. i  d.  exponential  random  variables  and  S*  =  A, . 

Proposition  1.11.  For  a  €  f  0.  ^ )  and  k  an  integer  satisfying  2  <  k  <  a-1, 

(1.27)  P{Ta  >  *}  < 

(1.281  P{ra  >  k)  > 
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and.  with  the  additional  stipulation  that  a  <  g. 

f  e-a*(t-i,(1  _  a2/3^.)  iffc<Q-2/3 
(1.29)  P{ri>k}>l 

l  0  otherwise. 

Further,  the  lower  bound  (1.28)  is  greater  than  the  lower  bound  (1.29). 
Proof.  The  double  inequality 


(1  -  r)i  <  e-1  <  (1  —  x)'1^  (0  <  r  <  1 ) 


may  be  used  to  establish  (1.2T)  and  (1.28)  as  follows: 

P{ra  >  k}  =  (1  -  ka)k-' 

=  [(1  -  itQ)^]*a(*-1) 


<  e 


-ak[k-l) 


and 


P{~a  >  k)  =  [<1  -  W*3"1 j 


kai  >-  1  ' 

1  ~fca  ll  .1 


>  €  . 

To  prove  ( 1.29)  it  is  enough  to  show  that  for  all  appropriate  k  and  a. 


log  >>-*•>  [e-ak<*r-i>(  i  _  a2/3k))-'  | 
Using  (1.3)  it  follows  that  the  left  side  above  exceeds 


>  0. 


-  {kZ})(ka.]l  +  aV3k  +  la4/3k2  +  la2k 3  +  Iq8/3A.4 
( 1  —  ka )  2  3  4 


>  -J¥-  +  a^k  +  -a*'3k2  +  \a2k3  +  \a*'3k< 
1  -  a1/3  2  3  4 


>a2'3k  +  IaV3*2  5  2fc3  +  j. a8/3jt4 

2  3  4 


It  now  suffices  to  have 


-a2^3k  -  -a*^3k2  +  -a2k3  >  —1 
2  3  4 


for  all  A-  such  that  2  <  k  <  a~2/3.  It  is  not  hard  to  show  that  all  points  on  the 


1,5,  1 

V=  7*3-  r*2  + 


curve 
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for  which  x  €  (0.1)  lie  above  the  line  y  —  —1.  which  is  sufficient  to  complete  the  proof  of 
( 1.29;.  Note  that  the  claim  following!  1.29 j  has  been  proven  as  well.  | 

The  proof  of  the  following  proposition  is  omitted  since  the  results  are  easily  estab¬ 
lished  using  (1.27)  and  (1.29).  Also  (i)  is  just  a  special  case  of  Proposition  1.3. 

Proposition  1.12.  (i)  For  any  integer  t  >  0. 

P{aira  >  t)  —  e~‘2  as  a  j  0. 

(ii)  For  any  o  €  1 0. 2/3 ) . 

P{ra  >  (a-0]}  ~  e~a'  2°  as  a  1  0. 
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Table  A 

a  k k0(a)  (1  -  akja))^-1  e~ak°^ 

IO"2 

10 

0.387 

0.368 

IO-3 

32 

0.365 

0.359 

D 

II 

10-“ 

100 

0.370 

0.36S 

IO-5 

316 

0.369 

0.368 

IO"6 

1000 

0.368 

0.368 

10-2 

16 

7.31  x  IO-2 

7.73  x  10" 

IO"3 

63 

1.77  x  10~2 

1.89  x  10- 

C0l*0 

II 

a 

10~4 

251 

1.74  x  10~3 

1.84  x  10- 

10~5 

1000 

4.36  x  IO"5 

4.54  x  10" 

10-e 

3981 

1.27  x  IO"7 

1.31  x  10- 

io-2 

18 

3.43  x  IO'2 

3.92  x  10- 

10“3 

75 

3.12  x  IO"3 

3.61  X  10- 

°  =  i 

io-4 

316 

4.05  x  IO-5 

4.61  x  10- 

10~5 

1334 

1.68  x  IO-8 

1.87  x  10- 

10~7 

23714 

3.54  x  IO"25 

3.78  x  10“ 

IO'2 

22 

5.42  x  IO-3 

7.91  x  10- 

°  =  3 

IO”3 

100 

2.95  x  IO-5 

4.54  x  10' 

IO-4 

464 

2.80  x  IO-10 

4.46  x  10“ 

10-7 

46416 

1.65  x  10~94 

2.71  x  10" 

10~2 

25 

1.00  x  IO"3 

1.93  x  10- 

Q  =  To 

10~3 

126 

4.89  x  IO"8 

1.27  x  10- 

10~4 

631 

1.47  x  10~18 

5.11  X  10- 

COIT 

11 

0 

10~2 

32 

6.42  x  IO"6 

3.57  x  10 

10~3 

178 

8.56  x  IO-16 

1.74  x  10- 

nVV .1.-  \ 
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The  results  shown  in  the  table  suggest  that  the  approximation  formula 

P{ra  >  ft}  *  e~ak2 

works  rather  well  for  k  <  a-06,  and  also  performs  reasonably  for  the  case  of  a  =  0.625. 
The  approximation  seems  somewhat  disastrous  for  a  >  however,  the  case  of  a  =  3  is 
not  nearly  as  severe  as  the  cases  for  which  a  >  |.  Similar  results  for  cases  with  a  =  0.65 
and  a  =  0.66  have  the  ratio  r0(a)  tending  slowly  to  1  as  expected.  Thus  the  requirement 
that  q  be  less  than  |  in  part  (ii)  of  Proposition  1.12  doesn’t  seem  to  be  needed  only 
because  a  sufficiently  tight  lower  bound  is  not  known  for  P{ra  >  k },  rather  its  necessity 
seems  to  reflect  the  apparent  fact  that  the  deviation  from  exponentiality  in  the  tail  of  the 
distribution  of  ra  tends  to  become  more  pronounced  for  values  around  a-2/3  or  greater. 

The  following  proposition  follows  immediately  from  (1.27)  and  Theorem  1.2  since  for 
random  arcs  of  length  a  on  a  circle  of  circumference  1.  p  =  2a. 

Proposition  1.13.  £[ra]  ~  \  \f^  as  a  1  0. 

The  results  of  this  section  may  also  be  applied  to  the  problem  of  spacings  on  the  unit 
circle  or.  if  end  effects  are  ignored,  on  the  unit  interval.  Suppose  that  C} .  Cj.  •  •  • .  C\.  are 
selected  at  random  from  (0.  lj.  Form  the  order  statistics 


C(l)  <  C(2)  <  •  •  •  1  £”(*)• 


and  define  the  spacings  5i.S2.---.5it  by 

C(D  “  c(n)  +  1  for  ;  =  1 


S,= 


Let 


C(;)-C0-i)  for;  =  2. 
Mk  =  min{5i,  5j.  ■  •  -  ,  5*}. 


Then 


P{Mk  >  a)  =  P{ra  >  k). 

As  an  example,  it  may  be  concluded  from  the  results  in  Table  A  that  the  approximation 

P{Mk  >a}*  t-aki 
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should  be  fairly  good  whenever  a  is  less  than  min{L--5/3. 10  2). 

Holst  [22]  obtains  several  nice  results  concerning  the  random  variable  M He  also 
examines  the  length  of  the  jth  smallest  spacing,  a  quantity  which  is  related  to  the  prob¬ 
ability  of  j  collisions  occurring.  In  [23],  Holst  studied  the  asymptotic  behavior  of  the 
distribution  of  the  largest  spacing,  which  is  directly  related  to  the  probability  that  a  circle 
is  completely  covered  by  random  arcs  of  equal  length.  A  review  of  other  results  concerning 
spacings  is  contained  in  [46]. 

1.5.  Arcs  of  Unequal  Length 

This  section  deals  with  sequences  of  random  arcs  having  fixed,  but  not  necessarily 
equal,  lengths.  The  arcs  have  clockwise  endpoints  Cj ,  C2,  ■  ■  ■  which  are  independently 
selected  from  a  circle  of  unit  circumference  according  to  a  uniform  distribution.  The  first 
arc  has  fixed  length  aj.  the  second  arc  has  length  02,  and  so  on. 

In  this  setting,  two  arcs  will  collide  whenever  any  portion  of  them  overlap.  Note  that 
the  dual  interpretation  of  saying  that  a  collision  occurs  whenever  two  selections  from  the 
space  are  separated  by  a  distance  less  than  some  fixed  f  >  0  is  no  longer  applicable. 

Letting  r  denote  the  time  to  the  first  overlap,  it  is  easy  to  see  that  for  a!  4-  a2  <  1 

P{r  >  2}  =  1  -  Q2  -  (22- 

A  simple  inclusion-exclusion  argument  yields  that  if  ai  +  02  +  03  <  1  then 

P{r  >  3}  =  (1  -  ai  -  a2  -  a3)7. 


Likewise,  for  aj  +  a2  4-  a3  +  a*  <  1 

P{t  >  4}  =  (1  -  01  -  a2  -  a3  -  a«)3. 

The  calculation  of  this  last  usult,  or  any  subsequent  result,  by  brute  force  is  extremely 
tedious;  however,  a  slight  modification  of  a  clever  argument  due  to  Stevens  [57]  establishes 
that 


P{T>k}  =  [(  1  -«*)+]*"' 


3 


i 


(1.30) 


.  v_v> vx.'*  .■>  ,v\-_vv 
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for  k  >  2.  where  s*  =  ai  + - 1-  a*.  (1.30)  may  also  be  obtained  by  appealing  to  a  result 

due  to  Marsaglia  [39]  and  de  Finetti  [8].  Their  result  states  that  if  (A'j.AV  •  •  •,  A'n)  is  a 
random  point  on  the  simplex  {x  6  IRn  :  x,  >  0,ii  + - (-  xn  =  1},  then 

P{ A'i  >  A'n  >  an}  =  [(1  -  a, - a„)+ f"1. 

(1.30)  follows  immediately  since  it  can  be  shown  that  the  joint  distribution  of  the  set  of 
spacings  which  occur  between  n  points  independently  and  uniformly  selected  from  a  circle 
of  unit  circumference  is  uniform  on  the  simplex  given  above  (see  p.  76  of  Feller  [15]). 

An  interesting  consequence  of  (1.30)  is  that  P{t  =  fc}  depends  only  on  n*  and  Sk-\ 
and  not  on  the  individual  values  of  d\.  ■  •  ■  .Uk-i-  This  may  be  seen  through  the  equation 
P{r  =  k}  =  P{t  >  k  -  1}  -  P{r  >  k } 

=  (1  —  Sk~i)k~2  -  (1  —  Sk-i  —  at)*-1. 

t  can  be  made  to  have  numerous  interesting  distributions  by  choosing  aj.cjT.--- 
in  certain  ways.  In  the  examples  given  below,  several  cases  are  examined  ,n  which  the 
sequence  of  arc  lengths  is  either  monotonicallv  increasing  or  decreasing.  In  the  first  case 
presented,  the  prescribed  sequence  of  lengths  has  the  time  of  the  first  collision  being 
equally  likely  over  a  range  of  values. 

Uniform  Collision  Time 

If  a i .  q2.  •  ■  am  are  such  that 

«1  +  a2  =  (m  —  1)_1, 


for  k  =  3,4,  •  •  -  ,m  —  1.  and 


.  and 


P{t  =  2}  =  P{r  =  3}  =  ■■■=  P{r  =  m}  = 


m  -  1 


E[r  ]=f +  1- 


WM 


Jj 


.«  Jui 
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Geometrically  Decreasing  Arcs 

Now  consider  the  sequence  of  arc  lengths  given  by 


Ok  =  lk  lo 


where  ■)  £  (0.1)  and  0  <  a  <  5.  For  k  >  2, 


Sk 


so  that 


P{r  >  it}  = 


k-\ 


{k  =  2, 3.  ■  • 


Some  asymptotic  results  for  this  setting  are  given  in  the  following  theorem 
Theorem  1.3.  If  <n-  =  ** k~la  where  -y  £  (0.1)  and  0  <  a  <  5.  then 

(i)  P{T  <  k)  ~  (k  -  1)  (737)  a  as  a  i  0 


and 


(ii)  E[r) 


1  — -V 


as  a  1  0. 


Proof.  The  following  two  bounds  can  clearly  establish  (i). 

<  <*  - »  (ttt) 

and  for  a  <  4  -  1) 

P{t  <  k}  =  1  -  exp  j(*  -  l)log  |l  -  ^ ~  j  a  j 
>  1  -  exp  j-(*  -  1)  ^ ~  ^  'j  o  | 

><k~  !)  (ttt)  “  "  5  [<<: '  11  (VrT )' 
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Now  (ii)  will  be  established 


£[r]  =  £  P{r  >  kj 

4=0 


=  2  -t 

>2  + 


kzz  2  L 

x  ✓  \  it- 1 

£  -rfr) 

j _ x  '  7 


=  1  + 


4-1 


and  so 
(1.31) 


Um  inf  1  >  1 . 

— o*  <^1 


Also,  for  any  rv  >  3  and  a  sufficiently  small. 


£lr’  =  2  -e 


**<■"-»'+£  [.-(ttt)' 

*=m  1  X  ' 


/r-1 


=  777  ± 


<  m  + 


1 


l  — * 


)G 


(—)- 


so  that 

£[r]  1 

limsup-i—  <  • 

B-0+  (—  )  1  ' 

Since  the  above  inequality  holds  for  all  m  >  3,  it  must  be  true  that 

fir] 

limsup  -j—  <  1. 

B-0+  (  —  ) 

This  result  and  (1.31)  together  imply  that 


£[r]~ 


as  a  — *  0+ . 
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Displayed  below  are  a  few  exact  values  of  £[r]  along  with  their  corresponding  asymp¬ 
totic  approximations  for  the  case  of  p  =  Note  that  the  lower  bound  1  +  provides 


s 


x-W’.1  ■JW.W*1>',>’.»',.»’,,.,j 
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an  estimate  of  E  ~  which  is  even  more  precise  than  the  asymptotic  form  gi\en  by  tl 
theorem  above. 


Q 

£[r] 

1 

a 

0.1 

6.1401 

5 

0  01 

51.019 

50 

0.002 

251.00 

250 

Uniformly  Increasing  Arcs 

Now  instead  of  a  decreasing  sequence  of  arc  lengths,  consider  the  case  where  t lie 
lengths  increase  according  to 

ak  =  ka. 

Then 


f  fi  _  kJE±lla\k~' 

P{r  >  A-}  =  |  l  2  aj 


(2  <  Hr  <  M(a)) 

(k  >  A  /(a)) 


where 


A/ ( a )  = 


and 


Asymptotic  results  concerning  r  are  provided  by  the  following  theorem. 

Theorem  1.4.  If  ak  =  ka,  then 

(i)  P  |(|)1/3r  >  t|  ~  e-'"’  as  a  JO 

(»)  E\r}  ~  (j)'/3r  (5)  as  o  1  0- 

Proof.  For  k  <  M(a) 


log  P{r  >*}  =  (*—!)  log 


,  fc(fc  +  l)  ' 

1 - - - a 


( k  -  1  )k(k  +  1) 

<  ^  a 
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so  that 


r  ik  -  D3 

P{r  >  k)  <  exp  I - - - a 


Then  for  a  sufficiently  small 


War-- 


}*4 


— - a  (2  <  k  <  SI  {a )). 

rr  /  2  \ 1/3  n ) 


<exp(~i '  'll-1) 


<exp  ri((!) 7  "2 


f  7  /< 

l\l/3  ,  /< 

J\2/3  } 

exp  <  -r  +  6  ( 

-)  r-l2(- 

-)  (  +  4o  I 

P{(  f  )1/3f  >  <} 

Iimsup - i - -  <  1. 

a— •O'* 


fl  ^  miri  {slT^TT-  VW*  then 

P{r  >  k}  =  exp  j(*  -  l)log  1  -  kl~^  -K  j 

f  ,  [  *(*  +  11  fk(k  a-  i)  \2 

>  exp  |  (*  -  1) - - - a  -  ^ - - - a) 

(*£*±11.)’' 

>  1  -  y«J]  • 

follows  that  if  a  is  sufficiently  small  then 

>  exp"*3  -  (4a)1/3f5j 


'’{(ir— ’ 


i  truji 

.  fP{(|l,/3r  >f)  ^  , 

lim  inf - -j -  >  1. 

a— o+  e"f 

ch  when  combined  with  a  previous  result  establishes  (i). 


*T» 


V  V  V  V  .* 
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The  hounds  found  above  vield  that 


E[r)  =  2+  JT  £{r  >  k} 


A/Ul-1 


<2.  £  e-^' 

=-arrG) 


fV.  =  2+  />fr  >  *} 

k=2 

r\t(3)  3  /  r5  \ 

>/«  <-v-0 -H 


Now  for  a  sufficient!)  small,  yd^  >  1  for  all  7  >  .\/(a).  Therefore,  when  a  is  small  enouc' 


^gh 


These  upper  and  lower  bounds  for  £ V]  can  be  used  to  easily  establish  (ii).  I 


The  accuracy  of  the  approximation 


is  demonstrated  below  for  various  values  of  a. 


\-yy-y- 
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F'r]  (f|1/3r(^ 


IS 

3. 197 

2.775 

1 

ss 

4.672 

4.279 

1 

32S 

8.102 

7.736 

i  9ts  x  i<r4 

19.65 

19.30 

4  9*  x  10-' 

30.93 

30.59 

7 .98  x  10-6 

56.63 

56.29 

2.00  x  10"* 

89.66 

89.33 

5  00  x  10_* 

142.1 

141.8 

2.22  x  10“" 

186.1 

185.8 

Quadratic  Growth 

If  the  arc  lengths  are  given  b> 

ak  a  k‘a 


then 


P{f  >  k 


*»■( 


frU-  +  lU2t  +  1) 

1  -  - 7 - O 


t-1 


for  2  <  k.  For  a  not  too  large,  the  following  bounds  are  true: 

P{t  >  k)  <  exp  (-^") 
and 


P{r  >  k }  >  exp  (- ~(k  +  l)4) 


1  -  —  (k  +  1)' 


These  bounds  can  be  used  to  prove  the  following  theorem  (whose  proof  is  omitted)  by  a 
method  which  is  entirely  similar  to  that  which  is  used  in  the  proof  of  Theorem  1.4. 


Theorem  1.5.  If  ak  =  k2a,  then 
(i)  P  j(§)1/4  r  >  tj  ~  f-'*  as  a  j  0 


N 

\ 


s 


E 


s 


0 


<■ 


w 


and 


di'  E[r] 


(2),/4r(|]  as  a  10 
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Displayed  below  are  a  few  exact  values  of  Ej  along  with  their  corresponding  ap¬ 
proximations 


a 

£(rj 

3  48  x  lO”4 

9.003 

8.731 

2.96  x  10'6 

29.03 

28.77 

3.00  x  10-9 

161.5 

161.2 

Exponentially  Growing  Arcs 

As  a  final  example,  consider  the  sequence  of  arc  lengths  given  by 


a*  =  Tk  ‘a  (•>>!). 


so  that  Sk  increases  geometrically.  Then,  for  it  >  2 


'!->*>  =  (H^fr)°Lr 


Neither  a  tight  upper  bound  nor  a  tight  lower  bound  is  readily  established  for  £V; 
however,  it  is  found  that  £(r]  can  be  closely  approximated  by  a  quadratic  polynomial  in 
logo  whose  coefficients  depend  on  7.  For  instance,  if  7  =  2  then 

E[r]  =  f2(a)  =  (8.811  x  lO^Klogo)2  -  1.335 logo  -  3.10. 

and  if  7  =  3/2.  then 

£[r]  =s  /3/2(o)  =  (5.065  x  10"3)(logo)J  -  2.107 logo  -  6.36. 

The  closeness  of  these  approximations  can  be  seen  in  the  results  given  below.  In  each  case 
the  fit  was  done  using  more  than  twenty  points,  and  the  comparisons  presented  below  are 
just  representative  samples  given  to  show  the  accuracy  of  the  approximation 


St 

cltou  1  6  .- 

a 

E'f 

fii  a  i 

9.78  x  10~* 

7.14 

6.20 

9.54  x  10"r 

15.79 

15.58 

9.31  x  lO"10 

25.08 

25.05 

9  09  x  10*13 

34.60 

34  60 

8.88  x  10* 

44.23 

44.23 

a 

E'r) 

2  01  x  10*fe 

21.64 

21.57 

3 43  x  10*? 

26.15 

26.13 

4.52  x  10*’' 

30.73 

30.73 

5.95  x  10*9 

35.37 

35.37 

7.84  x  10*10 

40.05 

40.05 

1.6.  Arcs  of  Variable  Length 

This  section  considers  another  generalization  of  the  packing  sequence  described  in 
Section  4.  Here  each  selection  from  the  space  5  =  (0.  l]  will  be  taken  to  be  the  clockwise 
endpoint  of  an  arc  having  variable  length  on  a  circle  of  unit  circumference.  The  length  of 
each  arc  placed  will  be  an  independent  observation  from  a  distribution  F.  and  a  collision 
will  occur  whenever  any  pair  of  arcs  are  not  disjoint.  The  related  covering  problem,  where 
random  arcs  of  variable  length  are  placed  on  a  circle  until  its  circumference  is  completely 
covered  has  been  studied  by  several  investigators  (see  [24]  and  [51]  for  example);  however, 
the  collision  problem  discussed  here  does  not  seem  to  have  received  much  attention. 

Let  the  sequence  of  arcs  be  labeled  and  let  arc  A,  have  clockwise  end¬ 

point  C,  and  length  L,.  Thus  C1.C2.- -  •  are  i.i.d.  random  variables  with  a  uniform  (0. 1] 
distribution  and  -  are  i.i.d.  random  variables  having  c.d.f.  F.  Furthermore,  the 

Z.,'s  are  independent  of  the  C,  "s. 


.sV.st'.’ 


rvvwvw»n 


'  I 
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Now  let 


Sg  —  L\  t  ■  ■  ■  +  Lg 


and  let  Fg  denote  the  distribution  of  Sg.  Let  mF  denote  the  mean  of  a  random  variable 
having  distribution  F,  and  as  was  the  case  in  the  previous  section  let  r  give  the  time  of 
the  first  overlapping  of  arcs. 


The  following  theorem  provides  a  lower  bound  for  P{t  >  k }  in  the  setting  describee 


above. 


Theorem  1.6.  For  a  sequence  of  random  arcs  having  lengths  L\.  L2.  ■ 


P{t  >  k]  >  [(1  -  kmF)+]k 


for  anv  choice  of  F. 


Proof.  It  follows  from  ( 1.30)  that  for  k  >  2 


( 1.32) 


P{r  >  k}  =  J  P{r  >  k  1  5*  =  s)dFk,x  \ 
=  j[(l  -  s).]k-'dFg<s). 


Then  using  Jensen's  inequality,  it  follows  from  ( 1.32)  that 

P{-  >  k}>Ul-  E[Sg])+]k-\ 

(1.33) 

=  [(1  -  km/r  u}k  1  | 

Note  that  a  consequence  of  (1.33)  is  that  £[r]  for  a  packing  sequence  of  arcs  having 
variable  length  prescribed  by  F  is  greater  than  or  equal  to  E[r]  for  a  packing  sequence  of 
arcs  all  having  length  mF . 

Unfortunately.  (1.32)  is  not  necessarily  easy  to  evaluate  exactly,  since  for  many 
choices  of  F  the  distribution  Fg  is  troublesome  to  obtain.  However,  notice  that  if  £  ( i )  =  1 
then 

P{r  >  2}  =  £[1  -  S2] 

=  1  -  2 £[!,]. 
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where  fk  is  the  density  for  the  distribution  Fk .  The  above  formula  for  Fk(s)  ws  originally 
obtauned  by  Laplace  [33] . 

It  should  be  noted  however,  that  the  evaluation  of 

j [d  -  * )+]*■' A(«)* 

becomes  quite  tedious  for  larger  values  of  k.  Alternatively,  for  Jt  <  I  one  may  use 

P{r  >  k}  =  b~k  f  f  ■  ■  ■  I  i  \  -  I\  -  x-2  -  ■■■  -  Xk)k~X di\di2  ■  ■■  di k- 

Jo  J o  J o 


As  another  example,  suppose  the  arc  lengths  are  distributed  such  that 

a  w.p  q 


■-{ 


b  w.p.  1  -  q 


where  a  <  b  Then  for  k  >  2. 


1 1 .34  ' 


P{~  >  k)  =  JVl  -  ja-  (k-  j)bU)k~l  f*V(l  -  q)k~3 . 

;=0 


Now  some  comparisons  will  be  made  between  various  values  of  P{r  >  k)  and  £V 
arising  from  different  distributions  all  having  the  same  mean.  Consider  eleven  distribu¬ 
tions,  G\,  ■  ■  ■  ,G ii.  each  having  mean  ygg.  Let  G\  be  the  trivial  distribution  which  assigns 
probability  1  to  the  outcome  y^.  Let  Gj  be  the  uniform  distribution  on  ( 200  •  2^0]  •  anc^ 
let  G 3  be  the  uniform  distribution  over  (0,  .  Let  G 4  assign  equal  probability  to  each  of 

the  values  ^  and  333.  and  let  Gs  give  equal  probability  to  the  outcomes  yg^  and  yg|g. 
Let  G&  be  such  that  if  L  ~~  G6  then  P  {L  =  735}  =  2/3  and  P  {L  =  gg}  =  1/3.  and  let 
G7  be  such  that  if  L  ~  Gr  then  P  {L  =  555}  =  5  and  P  {L  =  Jy}  =  3.  Each  of  these 
first  seven  distributions  has  a  variance  that  is  no  greater  than  8.1  x  10-5. 

The  variances  of  the  next  four  distributions  increase  from  a  value  of  4.725  x  10-4  for 
Gg  to  a  value  of  2.34  x  10-3  for  Gu-  Gg  is  such  that  if  L  ~  Gg  then  P  {L  =  yggg}  =  733 
and  P  {L  =  =  y|y.  and  G9  is  such  that  if  L  ~  G9  then  P  {L  =  jobs)  =  §4§  and 

P  { L  =  | }  =  dyg.  Let  G10  be  such  that  if  L  ~  G10  then  P  {L  =  j—  }  =  and 
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P  [L  =  §}  =  Finally,  let  Gn  be  such  that  if  L  ~  Gu  then  P  {L  =  and 

Let  Tj  denote  the  time  of  the  first  collision  for  the  packing  sequence  of  random  arcs 
having  lengths  distributed  according  to  Gy  Then  for  j  =  1 ,  -  ■  • ,  1 1 ,  P{t1  >  2}  =  0.98. 
The  following  table  gives  the  exact  values  of  P{t:  >  3}  and  P{'j  >  4}  for  j  =  1.-  •  •,  11. 
as  well  as  estimates  of  the  E[r}]  based  on  100,000  computer  simulation  trials  for  each 
distribution.  (£[ri]  is  exact.) 


j 

P{r3  >  3} 

P{r:  >  4} 

E[Tj\ 

1 

0.940900 

0.88474 

9.622 

2 

0.940925 

0.88483 

9.636 

3 

0.941000 

0.88512 

9.687 

4 

0.940975 

0.88502 

9.694 

5 

0.941143 

0.88567 

9.823 

6 

0.940967 

0.88499 

9.685 

t 

0.940967 

0.88499 

9.680 

8 

0.942318 

0.89010 

11.05 

9 

0.944178 

0.89686 

13.S2 

10 

0.944264 

0.89718 

14.23 

11 

0.947920 

0.90954 

21.60 

It  may  be  seen  that  the  values  of  £[r;]  for  the  six  nonconstant  distributions  having 
small  variances  are  not  too  much  greater  than  E[t{\.  For  the  four  distributions  with  large 
variances,  the  values  of  £[rj]  increase  as  the  variances  increase,  and  they  are  considerably 
larger  than  E[ti\.  Also,  it  is  apparent  that  £[r]  is  not  asymptotically  proportional  to 
(P{A1AA2}r1/2,  as  was  the  case  when  all  of  the  arcs  had  the  same  length. 

When  comparing  arc  length  distributions  having  the  same  expectation,  it  is  conjec¬ 
tured  that  E[t]  increases  as  the  variance  does.  So  with  a  large  variance  the  arcs  tend  to 
avoid  each  other  and  thus  cover  the  circle  at  a  faster  rate  than  arcs  of  smaller  variance 
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do.  This. is  consonant  with  the  conjecture  examined  by  Siegel  [49]  and  Huffer  [24]  dealing 
with  the  coverage  of  the  circle  by  random  arcs.  They  conjecture  that  when  comparing 
arc  length  distributions  having  the  same  expectation,  that  if  one  concentrates  more  mass 
near  the  expectation,  then  the  corresponding  coverage  probability  will  be  smaller.  In 
other  words,  they  claim  that  a  smaller  variance  gives  a  slower  rate  of  coverage.  But  this 
suggests  that  the  arcs  may  tend  to  overlap  each  other  more  frequently,  which  would  mean 
that  their  conjecture  is  somewhat  consistent  with  the  observation  that  E[r ]  decreases  as 
the  variance  does. 

An  Unusual  Case 

An  interesting  case  of  a  two  outcome  distribution  is  where  one  of  the  outcomes  equals 
zero,  and  the  other  outcome  is  such  that  E[L,}  =  m.  That  is,  suppose 

(0  w.p.  q 

A  w-p-  1  - «?' 

where  0  <  m  <  1  —  q.  A  collision  occurs  whenever  any  two  arcs  of  positive  length  overlap, 
or  whenever  a  point  arc  of  zero  length  is  covered  by  an  arc  of  positive  length.  An  easy 
argument  establishes  that  (1.34)  remains  true  even  for  this  degenerate  case.  The  following 
theorem  is  a  statement  of  this  fact. 


Theorem  1.7.  Suppose  arc  lengths  L\,L2,-  -  are  distributed  according  to  (1.35) 


Then 


p{T>k)  =  Y,(K  1  -(fc-»7“]+) 


Proof.  Let  Hkj  be  the  event  that  exactly  j  of  the  first  k  arcs  have  length  zero.  Then 


(1.36) 

Now 

(1-37) 


p{r  >  *} = >  *i  -  q)k~} ■ 

n  \J / 


J=0 
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and 


(1.38) 


P{r  >  k  |  Hk.k}  =  1. 


For  0  <  j  <  k,  P{t  >  k  |  Hki])  is  just  the  probability  that  (k  —  j)  random  arcs  of  length 
are  disjoint  and  that  j  random  points  avoid  the  portion  of  the  circle  covered  by  the 
positive  length  arcs.  Upon  conditioning,  it  may  be  seen  that 


p{r>k\Hkj}  =  ( i-(k-j)^-  y( 


i-(k-j) 


m 


1  -  9.1  + 


1  -U--j)~^-j  ) 
+/ 


Noticing  that  this  formula  also  holds  for  the  cases  given  by  (1.37)  and  (1.38),  it  follows 
from  (1.36)  that 


P<r>k}  =  £ 

j=  o 


l-(t-j) 


m 


r  o 


1  -  q\ 

(which  is  the  same  as  (1.34)  for  this  case).  | 

Now  suppose  that  q  =  (1  —  m).  so  that 

0  w.p.  1  —  m 


f;(l  -q)k-J 


(1.39) 

Clearly,  for  k  >  2 


L,  = 


1  w.p.  m. 


P{t  >k}  =  P{Hk,k } 

=  (1  -  m)k. 


Hence 


£M  =  2  +  £(l-m)* 

k= 2 


=  m  +  — . 
m 


Putting  m  =  0.01  yields 


£(r]  =  100.01, 


which  is  larger  than  the  value  of  £[r]  for  each  of  the  previously  considered  distributions 
having  £[£,]  =  0.01. 


% 

v 
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Another  Interpretation 

Now  suppose  that  a  collision  is  considered  to  have  occurred  only  if  there  exists  an 
overlap  of  positive  length.  Then,  unlike  the  previous  interpretation,  the  arcs  of  length 
zero  cannot  participate  in  a  collision. 

The  distribution  of  r  is  easy  to  determine  in  this  case.  If  the  arc  lengths  are  dis¬ 
tributed  according  to  (1.35).  then  for  k  >  2 

k-2 


P{r  >k)  =  Y,P{r>k\  -  9)t_J 

o-*-' 


If  the  lengths  are  given  by  (1.39),  then  for  k  >  2 

P{r  >  k}  =  P{Hk.kuHk.k_,) 

=  (1  -  m)k  +  Jb(l  -  mf-'m. 

It  follows  that  in  this  case 


E[r]  = 2  +  y^[(  1  -  m)k  +  k(  1  -  m)t-1m] 
*  =  2 

_  2_ 
m 


Some  Bounds 


As  mentioned  previously,  the  evaluation  of  the  formulas  for  P{r  >  k }  becomes 
difficult  unless  k  is  rather  small.  It  is  therefore  convenient  to  establish  upper  and  lower 
bounds  for  these  probabilities. 

For  any  choice  of  F ,  (1.33)  provides  a  lower  bound  for  P{r  >  k}.  It  then  follows 

—  2/3 

from  the  proof  of  Proposition  1.11  that  for  k  <  mF 

P{r  >k}>  e~mF  -  m2F/3k), 


provided  that  mr  < 

An  upper  bound  for  P{t  >  kj  is  given  by  the  following  theorem 


yi 


yi 


y ,  y.  *.  . 
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Theorem  1 .8.  For  a  sequence  of  random  arcs  having  lengths  L\.  L?.  -  ■  •  1~d  F . 

(1.40)  P{t  >  k)  <  e-ik~l'zdF(  x)j 

for  any  choice  of  F. 

Proof.  If  0  <  Sk  <  1  then 

(1  -  Sk )+  =  (1  -  Sk) 

(1.41) 

<  e~Sk . 

This  inequality  is  also  true  for  5*  >  1  since  e~Sk  is  always  positive.  It  then  follows  from 
(1.41)  and  the  independence  of  the  L ,  that 

P{T  >  fc}  =  £{[d  -  St)+]*“’} 

<  f 

=  •—£-* )  j 

=  (y  e-<k-"*dFlx ))  •  I 

More  Examples 

The  upper  bound  ( 1.40)  is  generally  much  easier  to  evaluate  than  ( 1 .32).  For  instance, 
if 

P{L,  =  o)  =  P{L,  =  b}=1-. 
then  the  upper  bound  (1.40)  becomes 

(i.«,  (<-<»-■>• + ,  [Imh,  (,t .  „  (*z«))] 


It  may  be  shown  that  this  upper  bound  is  bounded  from  above  by  (1.42). 
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To  consider  a  specific  example,  suppose  that 

,,{il-SoH{I-  =  So}4 

Then  the  lower  bound  given  by  (1.33)  is 

P{t  >  it}  > 

so  that 


.(■-ss). 

OO 

£[r]  =  2  +  £/>{r  >  k } 
!(-*) 


k- 1 


*- 1 


>2  + 


=  9.622. 

Recalling  for  this  distribution  ( G4  from  before),  that  the  estimate  of  E'r]  obtained  from 
100.000  simulation  trials  was  9.694,  it  is  observed  that  the  lower  bound  is  reasonably  close. 


For  this  case,  the  upper  bound  given  by  (1.42)  becomes 
P{r  >  k}  < 


€XP  L_mi  4’exP  ['^j  \ 


Since  P{r  >  k}  =  0  for  k  >  199. 


r(  ,  ^  (exP  [~15o]  +  exP  \ 

*ws*  +  E - 5 — : - 1 


*=j 

=  9.945. 


This  upper  bound  for  £[r]  is  not  as  close  as  the  corresponding  lower  bound;  however,  it 
only  overestimates  by  about  2.6%. 

An  examination  of  the  case  where  £  is  the  uniform  distribution  over  [555,  again 
finds  that  the  bounds  produce  good  estimates.  The  upper  bound  (1.43)  yields 


Ws,+S( - — L) 


=  9.904. 

a  value  that  overestimates  the  observed  value.  9.636.  by  about  2.89? 
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The  lower  bound  in  this  case  has  the  same  value  as  did  the  lower  bound  in  the 
previous  case.  9.622.  This  occurs  because  the  lower  bound  provided  by  (1.33j  depends 
only  on  the  distribution  through  its  mean,  which  is  the  same  for  both  cases.  It  will  be 
seen  in  the  next  case  considered  that  the  lower  bound  sometimes  performs  very  poorly. 

If  the  arc  lengths  are  distributed  according  to  (1.39)  then 


so  that 


Putting  m  =  0.01  yields 


P{r  >  k)  <  [(1  -  m)  +  mr'1*-11)*, 


E[r]  <  2  +  ]T[(1  -  m)  +  mr|4-")‘. 


E[r]  <  100.025 


which  is  not  too  much  greater  than  the  exact  result. 

E[t]  =  100.01. 

Notice  however  that  the  corresponding  lower  bound,  which  is  again  equal  to  9.G22.  is 
extremely  poor  for  this  choice  of  F. 

Another  Approach 

An  alternative  upper  bound  for  P{t  >  Jr]  may  be  produced  by  the  method  used  in 
Section  2.  For  instance,  suppose  F  is  the  uniform  distribution  on  [a. 6],  and  let  p  denote 
P{Ar  A  A,}  (r  jL  s).  Then 

p  =  1  -  P{r  >  2}  =  b  +  a. 

P{Ar  A  A,.  A,  A  Au)  is  equal  to  p2  if  r,  s,  t  and  u  are  all  different.  If  s  =  u,  this  probability 


is  given  by 


— ' 1  - )3  J  J  J  P{A,  A  A,.  A,  A  A,  I  Lr  =  X,L,  =  y.L,  =  zjdidyd: 


13  ab 

=  —(a  +  6  2 - . 

12  1  3 


.  V  V  V 


\  v 


V  V. 
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which  is  less  than  | -|p2 
Now  for  it  >  3. 


P{r  >  k)  =  P  P|  {.4,  does  not  collide  with  4j} 

The  last  expression  above  is  less  than  or  equal  to  the  right  side  of  (1.7)  whenever 

(7*J-17*  +  3)  .  <*-  1)‘_.  . 


P  +  '  P  <  1. 

24 


Noting  that  the  left  side  of  1 1 .44  I  is  less  than 


7(*  -  l)2  (k  -  l)s  2 
12  P+  24  P  ‘ 


it  is  clear  that  ( 1  44)  is  true  for  all  k  <  1  +  min  j  y/Jp  j- 

The  key  steps  in  the  proof  of  part  (i)  of  the  following  proposition  have  now  been 
established.  The  remaining  details  of  the  proof  are  omitted.  Parts  (ii )  and  (lii )  are  proved 
in  the  same  way  as  the  analogous  propositions  in  Section  2. 

Proposition  1.14.  Suppose  F  is  the  uniform  distribution  on  [a.  6]  where  0  <  a  <  b  <  l. 


(i)  Let  M  =  min  j  (pSp)’”}.  Then 

Ptr  >*)<(  '-'"p-**'  for  2  <  t  < 
e~  4$~<a+!>)  otherwise. 


(ii)  For  fixed  t  >  0, 


P  |  \J ~~T  >  *  e“<J  as  (b  +  MIO. 


(iii)  For  0  <  a  < 


P{t  >  [(a  +  6)  °]}  —  exp 


--(a  +  b)'-2a 
2 


as  ( a  ■*-  6 )  i  0 
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Results  similar  to  these  may  be  obtained  for  the  discrete  circle  of  lengt !.  n. 


in  Section  3,  as  well  For  example,  suppose  F  is  the  distribution  that  assigns  equal 
probability  to  the  first  m  positive  integers,  where  rn  <  j .  Then 


1 


P{l,=j}=-  f*  =  1, 2.  ,  j  =  1. 

m 


•,ml, 


where  Lx  is  the  length  of  arc  At,  as  before.  The  clockwise  endpoints  c,.r2. 
independently  and  uniformly  from  5  =  {1,2. 

N  ow 


are  selected 


p=  P{Ar*A,}  = 

n 


P{A ,  A  A,.  A,  A  ,4„}  =  p‘ 

if  r.  s,  t  and  u  are  all  different,  and  for  r.  s.  and  f  all  different 


P{A.  a  A,.  A,  a  ,4S }  =  f '-P7  -  - - -"2 


13  , 
12; 


1  13 

12^  <  7?P* 


Noting  the  similarities  between  these  probabilities  and  the  analogous  ones  for  the  contin¬ 
uous  case,  it  follows  that  if  [a  *  6i  is  replaced  by  p  =  -J-.  then  the  results  stated  in  the  Iasi 
proposition  hold  for  the  discrete  case  also.  Similar  results  are  attainable  for  choices  of  F 
other  than  the  uniform  distribution,  in  both  the  discrete  and  continuous  cases.  However, 
not  everv  choice  of  F  can  be  successfullv  handled  in  this  manner. 


1.7.  Packing  Sequences  in  Two-Dimensional  Spaces 


In  Section  4  the  distribution  of  r  was  investigated  for  the  packing  sequence  of  placing 
random  arcs  of  length  a  on  a  circle  of  unit  circumference.  In  this  section,  several  two- 
dimensional  analogs  of  that  packing  sequence  will  be  examined. 

The  first  case  to  be  considered  will  be  that  of  placing  random  circular  disks  of  area  r 
on  a  two-dimensional  unit  torus  (see  Miles  [40]  for  a  description  of  the  fc-dimensional  unit 
torus).  Each  disk  has  radius  so  that  two  disks  collide  if  and  only  if  the  euclidean 
distance  between  their  centers  is  less  than  2^/^.  Formally 


S  =  (0,1]  x  (0,1] 


» 


31 


a 


u 


‘■■1 


S! 


t. 


£ 


V4 


V, 

V. 


$1 


V, 


.  -r  _  w*.  r 
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and 


The  points 


C.  =  (Ctl.Ct2)  (i  =  1,2,  •  ■•) 


are  the  centers  of  the  disks,  and  they  are  selected  randomly  from  S  by  letting  th-  C,}  be 
independent,  uniform  (0, 1]  random  variables.  The  distance  between  two  points  is  given 
by 

p(C,.C, )  =  ([p*(Cll.C,,)]2-M^Cl2.Cj2)]2),/2 


where 

'  1 1  -  y\ 
i  -  I*  -  yl 

This  packing  sequence  has  equivalent  points,  and 


p’ix,y)  = 


if  I*  -  yl  <  5 
otherwise. 


p  =  4  r . 


Unlike  the  case  for  arcs  on  a  circle,  the  distribution  of  r  is  not  known  precisely  in 
this  two-dimensional  extension.  Propositions  1.1  and  1.2  provide  lower  and  upper  bounds 
for  P{r  >  k }:  however,  the  upper  bound  is  not  sufficiently  tight  over  a  large  enough  range 
to  yield  an  asymptotic  expression  for  £[r]  as  p  tends  to  zero.  A  plausible,  but  not  exact, 
argument  will  be  used  below  to  derive  an  approximation  for  £[r]. 

For  each  disk  center  C,,  let  the  associated  “target  area”  be  given  by 
Ex  =  {(x,  y)  €  S  :  p((x,y),C\)  <  6}. 

Thus  a  disk  with  center  C3  will  collide  with  the  disk  having  center  C,  if  and  only  if  C,  6  £,. 
or  equivalently  if  and  only  if  C,  €  E,.  Also  let 

A,  =  {(x.y)  €  5  :  p((x,y).C,)  <  6/2). 


Then  .4,  represents  the  actual  disk  of  area  r  surrounding  the  point  C,. 


: 


i  -  a 
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Now  {r  >  k  -  1}  implies  that 

A,  n  Aj  =  ©  v  1  <  «  <  j  <  k\ 

however,  note  that  the  collection  of  sets  {  £, } * ” j1  need  not  be  mutually  disjoint.  The  event 
{r  <  Jt  -  1}  implies  that  .4,  n.4;  ^  ©  for  some  1  <  i  <  j  <  k.  This  means  that  for  at  least 
one  pair  (i.j).  the  area  of  E,  n  is  greater  than  the  maximum  allowable  overlap  of  target 
areas  when  all  of  the  A,  are  disjoint.  Now  it  is  heuristically  assumed  that  conditioned  on 
{t  <  k  -  1}  having  occurred,  the  area  of  1  £,  is  not  on  the  average  larger  than  it  is  for 
when  {r  >  Jt  -  1).  That  is.  it  does  not  seem  unreasonable  to  assume  that 


P{Ck  AC(*  -  1)  |  r  <  k  -  1}  <  P{Ck  AC(Jt  -  1)  |  r  >  Jt  -  1}. 


which  would  imply  that 


P{Ck/\C{k-  1)}  <  P{CkAC[k-l)[  r  >  k  —  1}. 


or  equivalent!} 


P{Ck  V  C(k  -  1 )  |  r  >  Jt  -  1}  <  P{Ck  v  Ck.x }. 


If  the  above  is  true,  it  then  follows  that 


P{t  >  Jt}  =  P{Ck  V  C(k  -  1 ).  r  >  it  -  1} 

=  P{Ck  vC(Jt-  1)  |  r  >  Jt  -  l}P{r  >  Jt  -  1} 
<  P{Ck  vC(k-  1  )}P{r  >  Jt  -  1}, 


and  induction  vields 


P{t  >k}<  P{CkvC(k-l)}P{Ck-X  vCfc_2}  -  P{C2vC,}P{r  >  1}. 


Since  P{t  >  1}  =  1  and 


P{Cm  vcm_,}  =  (1  -p)T 


(m  =  2.3.  ■  •), 


v;.v.\vv,v- 
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it  follows  that 


T‘ 


P{r  >  k)  <  (l  -  p)£-m»i 


=  (1  ~  P) 


, ,  f  fc — i  >3 

<  [d  -P)1/p]~p 


<  e 


If  this  upper  bound  is  indeed  true,  then  the  result  below  easily  follows  from  Theorem  1.2. 
Theorem  1.9.  Assume  (1.45).  Then  for  the  packing  sequence  of  circular  disks  on 


the  torus 


( 1.461 

or  equivalently 


£[7 


—  as  p  1  0. 
2  P 


£lrl  ~  “  t-i°- 


For  another  analog,  consider  the  random  placement  of  spherical  caps  having  surface 
area  v  on  a  sphere  of  unit  surface  area.  Each  cap  has  angular  radius  a  =  2  sin-1  ( v^)- 
and  two  caps  collide  whenever  the  great  circle  distance  between  their  centers  is  less  than 
f  =  2a.  This  packing  sequence  has  equivalent  points  with  p  =  4r(l  -  r).  Similar  to  the 
previous  analog,  it  follows  that  for  small  p  an  approximation  of  E[t]  is  given  by 


Thus  for  r  not  too  large. 

£M  *  ]/sv(l-v )‘ 

It  is  interesting  to  compare  the  above  result  with  the  analogous  one  for  random  disks 
on  the  unit  torus.  For  disks  and  spherical  caps  having  the  same  area,  the  results  imply 
that  the  expected  proportion  of  the  total  surface  area  covered  immediately  prior  to  the 
initial  collision  is  greater  for  the  caps  on  the  sphere  than  it  is  for  the  disks  on  the  torus. 
Since  P{r  >  2}  =  1  -  p  for  packing  sequences  having  equivalent  points,  it  also  follows 
that  P{t  >  2}  is  greater  for  spherical  caps  than  for  disks  of  the  same  area.  Because  it 
is  difficult  to  obtain  closed  form  expressions  for  P{r  >  for  caps  on  the  sphere  when 
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k  >  3.  it  will  not  be  proven  that  P{r  >  k}  is  always  greater  for  spherical  caps  than 
for  corresponding  disks:  however,  simulation  results  (not  reported  here)  indicate  that  this 
may  be  the  case. 

A  third  two-dimensional  analog  to  arcs  on  a  circle  is  the  placement  of  squares  of  area 
v  on  a  two-dimensional  torus  such  that  the  two  sides  of  each  square  are  aligned  to  be 
parallel  to  a  pair  of  perpendicular  axes  on  the  torus. 

For  this  extension,  the  space  5  and  the  centers  Ci.Cj,--  are  exactly  the  same  as 
thev  were  for  the  case  of  disks  on  t lie  torus.  However,  now 


n(Cx,Cj)  =  max{p*(Ctl.Cji ).  p’(C,2.  C]2 )} 


where  p*  is  the  same  as  before,  and 


f1  =  y/t\ 


This  packing  sequence  has  equivalent  points,  and  it  seems  reasonable  to  suppose  that. 


for  small  p. 


as  it  was  for  the  case  of  disks  on  the  torus.  For  squares  of  area  v. 


P  =  4v, 


so  that 


£M  *  t/s- 


Thus  the  expected  time  to  the  first  collision  for  random  squares  of  area  v  on  a  unit  torus 
appears  to  be  approximately  equal  to  the  expected  time  to  the  first  collision  for  random 
disks  of  area  r  on  a  unit  torus. 

The  following  table  displays,  for  both  settings,  some  estimates  of  E\t)  obtained  by 
computer  simulation,  as  well  as  the  corresponding  values  of  the  approximation  formula 
for  £[r],  E[tj -t]  denotes  the  observed  average  value  of  r  for  disks  of  area  t>,  and  £[r4>l] 
denotes  the  corresponding  value  for  squares  of  area  r. 
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A  Monte  Carlo  study  of  expected  time  to  first  collision  on  the  torus 

rdl  is  the  collision  time  for  random  disks  of  area  v 
r3fl.  is  the  collision  time  for  random  squares  of  area  v 


t’ 

#  trials 

E[rd_v) 

v/£ 

1 

16 

60000 

3.3255 

3.3282 

2.5066 

1 

25 

50000 

3.9517 

3.9570 

3.1333 

1 

36 

50000 

4.5833 

4.5862 

3.7599 

1 

64 

50000 

5.8134 

5.8334 

5.0133 

1 

81 

50000 

6.4412 

6.4551 

5.6399 

1 

100 

110000 

7.0696 

7.0753 

6.2666 

6.4  x  10“3 

50000 

8.6662 

8.6608 

7.8832 

2.5  x  10-3 

50000 

13.345 

13.335 

12.533 

1.6  x  10~3 

50000 

16.525 

16.515 

15.666 

9.0  x  10~4 

50000 

21.697 

21.713 

20.889 

6.25  x  10~4 

50000 

25.947 

25.822 

25.066 

4.0  x  10-* 

10000 

32.193 

32.110 

31.333 

1.0  x  lO'4 

10000 

63.440 

62.725 

62.666 

2.5  x  10“5 

10000 

126.90 

125.66 

125.33 

1.6  x  10'5 

10000 

158.63 

157.80 

156.66 

9.0  X  10~6 

10000 

209.35 

207.71 

208.89 

6.25  x  10~6 

10000 

251.95 

252.90 

250.96 

4.0  x  10"6 

10000 

313.93 

312.45 

313.33 

1.0  x  10~6 

5000 

631.50 

627.10 

626.66 

It  may  be  seen  above  that  for  the  larger  values  of  v,  E[rJ-v]  is  greater  than  E\rd <t.]. 


For  the  smaller  values  of  r  considered,  the  reverse  is  true  except  in  two  cases.  For  these 
two  cases,  the  apparent  discrepancy  may  possibly  be  due  to  the  smallness  of  the  samples 
and  the  variability  of  r. 
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It  is  interesting  to  note  that  for  ail  v  <  the  distribution  of  r  is  different  for  disks 
and  squares  of  the  same  area  r.  This  may  be  established  from  the  fact  that 

P{r  >  3}  =  1  -  12t>  +  ^32  +  J  v2 

for  disks  of  area  v  <  £,  and 

P{t  >  3}  =  1  -  12r  +  39f2 

for  squares  of  area  v  <  These  probabilities  were  calculated  explicitly. 

Packing  sequences  may  also  be  defined  in  spaces  having  dimension  greater  than  two. 
For  example,  consider  the  three-dimensional  extension  of  the  last  packing  sequence  de¬ 
scribed.  Such  a  packing  sequence  corresponds  to  placing  at  random  cubes  of  volume  r  in 
a  three-dimensional  unit  torus.  A  collision  occurs  whenever  two  cubes  overlap. 


For  r  < 


P{t  >2}  =  l-p=l-  8r. 


A  direct  calculation  yields  that  if  t>  <  ^  then 

P{r  >  3}  =  1  -  24 r  +  165r2. 

The  computation  of  P{t  >  k }  for  k  greater  than  3  is  not  easily  accomplished;  however,  it 
is  not  unreasonable  to  expect  that  if  v  is  sufficiently  small  then  P{r  >  k }  is  a  polynomial 
in  v  of  degree  k  —  1  having 


1  -  k{k  -  l)4r  =  1  -  Kip 


as  its  first  terms. 


An  argument  similar  to  the  one  given  for  the  two-dimensional  cases  suggests  that 
E{t]  should  be  closely  approximated  by 


V  2 p  V  16u 

To  be  more  precise,  if  (1.45)  is  true  then  it  easily  follows  that  £[r]  ~  y/lif- 


v 
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The  table  below  contains  estimates  of  E[t]  obtained  by  computer  simulation,  along 
with  corresponding  values  obtained  from  the  approximation  formula  above. 


V 

#  trials 

E[r) 

1 

64 

104 

4.372 

3.545 

1 

512 

104 

10.027 

104 

14.829 

104 

39.633 

1 

512000 

103 

311.652 

317.066 

1 

lb* 

103 

452.472 

443.113 

1 

(8xl06) 

200 

1229.625 

1253.314 

1.8.  Random  9-ary  Codewords 

Consider  the  metric  space  consisting  of  all  9-ary  n-tuples  with  Hamming  distance  as 
the  metric.  That  is.  let 


S  —  1 }  0  —  1.  '  *  * '  tl)} . 


and  for  1  =  (xj,-  ■  -  ,in)  €  5  and  y  =  (j/i ,  •  •  • ,  yn)  e  S  let 

n 

pi(x,y)  =  n  - 

1=1 


where 


if  w  =  0 


«o  =  \ 

[0  if  w  ^  0. 

Thus  the  Hamming  distance  between  two  points  is  just  the  number  of  coordinates  in  which 
they  differ. 

Note  that  there  are  9"  points  in  the  space.  If  two  points  collide  whenever  the  Ham¬ 
ming  distance  between  them  is  less  than  d  €  {1,2,-  -  -} ,  then  the  probability  that  two 
arbitrary  points  collide  is  just 


?v  >'  V  V  '• 

ft 


‘ •  '■  ■1,«v '•"*  V"*>  ’  f  V  W.V."/.V.V,V4  A  w.  IT.  r,  \r.  *  K_*  <- HTH."VV]| 
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Note  that  for  d  equal  to  1.  the  setting  is  the  same  as  that  of  the  basic  birthday  problem 
when  there  are  q n  days  in  a  year. 


For  the  case  of  q  equal  to  2  and  d  >  1  fixed,  a  result  of  Kozlov  [32]  yields  that  as  n 
and  k  tend  to  infinity,  if 


d)  —  A 


then 


P{t  >  *}  -  «-A. 


This  suggests  that  the  approximation 


P{t  >  k)  SS  e~C3)P(2.n.d) 


should  not  be  too  bad  if  n  and  k  are  not  too  small. 


The  results  of  Section  2  may  also  be  applied  in  this  setting,  since  the  packing  sequence 
has  equivalent  points.  Proposition  1.3  yields  that 


P{q,ri,d)\l/ 2 


r  >  t  }  ~  e" 


as  n  —  oc. 


for  fixed  q  and  d.  Furthermore,  an  argument  similar  to  the  one  preceding  ( 1.45).  combined 
with  the  result  of  Theorem  1.1,  suggests  that  the  approximation 


(1.47) 


£['|sl/S; 


d) 


should  do  reasonably  well  provided  that  n  is  not  too  small. 


As  an  example,  consider  the  case  where  q  =  2  and  d  =  3.  Letting  E[r}m  denote  the 
average  value  of  r  obtained  from  m  random  packing  sequences  performed  by  a  computer 
using  a  pseudo  random  number  generator,  the  table  below  displays  values  of  E[r]m  and 
the  approximation  given  by  (1.47)  for  various  values  of  n. 


I 
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n 

m 

\J  2p(2.r>.3) 

9 

1000 

4.96 

4.18 

11 

1000 

7.73 

6.93 

13 

1000 

12.52 

11.83 

15 

500 

21.70 

20.62 

17 

500 

36.92 

36.56 

19 

500 

66.51 

65.66 

21 

250 

119.92 

119.16 

23 

250 

217.41 

218.10 

iws  some  cor 

responding  results  for 

n 

m 

E[r)m 

2p(2.n,5) 

15 

500 

6.08 

5.15 

16 

500 

7.00 

6.40 

17 

500 

8.45 

8.00 

18 

500 

10.89 

10.09 

19 

500 

14.08 

12.79 

20 

250 

17.16 

16.30 

21 

250 

21.68 

20.89 

22 

250 

27.06 

26.89 

23 

250 

34.56 

34.76 

24 

250 

47.86 

45.11 

Now  consider  an  alternative  packing  sequence  on  the  space  of  ternary  n-tuples  having 
a  metric  p.  given  by 

H(x,y)  =  max  \xt  -  j/,|. 

l<t<n 

Two  random  points  collide  whenever  the  distance  between  them  is  less  than  2.  Thus  two 
points  C,  and  Cj  do  not  collide  if  and  only  if  for  some  m,  C,m  =  0  and  C;m  =  2.  or 


mm 


GW 


v.y.v.v.y 
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Ctm  -  2  and  Cjm  =  0. 

This  packing  sequence  may  also  be  described  as  follows.  Imagine  a  cube  in  n  di¬ 
mensions  of  sidelength  4,  having  a  cubic  lattice  of  unit  sidelength  superimposed  within 
it  so  that  each  vertex  of  the  cube  coincides  with  a  lattice  point.  Smaller  cubes,  having 
sidelength  2,  are  then  sequentially  placed  at  random  within  the  larger  cube  so  that  each 
vertex  coincides  with  one  of  the  lattice  points.  This  may  be  accomplished  by  letting  the 
C,}  (t  =  1,2,  •  •  •,  j  =  1,  •  •  -  ,n)  be  i.i.d.  random  variables  having  a  uniform  distribution  on 
the  set  {0, 1,2}.  Equivalently,  each  of  the  small  cubes  is  put  uniformly  at  random  at  one 
of  3r‘  possible  locations  within  the  big  cube. 

The  centermost  location  within  the  large  cube  is  labeled  (1,1,-  ■•,!)•  The  sides  of  a 
small  cube  placed  at  this  location  do  not  touch  any  of  the  sides  of  the  large  cube.  This  is 
not  true  if  a  small  cube  is  placed  at  any  other  location. 


A  collision  occurs  whenever  any  two  small  cubes  are  not  disjoint.  Notice  that  a  small 
cube  placed  at  (1.  •  •  •.  1)  will  collide  with  any  other  cube. 

The  probability  that  any  two  arbitrary  points  collide  is 


This  follows  from  the  fact  that 


P{\Clm  -CJm\  <2} 

=  P{1  ctm  -  cjm I  <  2  I  Cjm  =  1}  Q)  +  P{\Ctm  -  C]m I  <  2  |  cjm  e  {0.2}} 


9' 


The  packing  sequence  does  not  have  equivalent  points  since 


» 


P{C,  AC,  I  Cj  =  (1,  •••,!)}  =  l#p. 


Thus  Theorem  1.1,  which  provides  a  lower  bound  for  E[t ]  which  is  asymptotically  equiv¬ 
alent  to  is  not  applicable.  The  argument  in  Section  7  which  suggests  that  E{t)  is 
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bounded  from  above  by  is  also  not  applicable. 


In  order  to  see  whether  or  not  \JTp  provides  a  good  approximation  for  £[r]  for 
this  packing  sequence,  random  packings  were  generated  by  computer  for  various  values  of 
n.  The  table  below  displays  results  obtained  from  these  packing  attempts,  along  with  the 
corresponding  values  of  The  notation  is  the  same  a  before.  Estimates  of  the  standard 
deviation  of  £[r]m  are  also  given  since  r  has  great  variability  in  this  packing  scheme. 
Although  it  is  not  conclusive,  the  results  indicate  that,  for  small  p.  approximates 
£[rl  reasonablv  well  even  in  this  unusual  case. 


n 

m 

(?r,1v/5 

£[r]m 

s  d.  (£(rjm) 

5 

10000 

2.35 

3.37 

0.01 

10 

10000 

4.40 

5.66 

0.03 

15 

5000 

8.25 

9.80 

0.07 

20 

2000 

15.47 

17.06 

0.20 

25 

1000 

29.00 

32.00 

0.53 

30 

1000 

54.35 

57.26 

0.96 

35 

500 

101.88 

106.50 

2.58 

40 

100 

190.96 

188.69 

9.59 

45 

100 

357.94 

365.94 

17.20 

50 

100 

670.92 

638.79 

32.75 

The  table  below  compares  the  distribution  of  r  for  packing  sequences  of  arcs  and 
simple  cubes  for  which  p  is  the  same.  The  results  for  the  arcs  on  the  circle  are  exact,  while 
the  simple  cubic  results  are  estimated  from  the  simulation  trials. 
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E[r]  s.d.(r) 


p 

41 

arcs 

cubes 

arcs 

cubes 

6.56  x  10"3 

15.47 

16.23 

17.06 

7.84 

9.04 

1.87  x  10"3 

29.00 

29.75 

32.00 

14.91 

16.67 

5.32  x  lCT4 

54.35 

55.11 

57.26 

28.17 

30.40 

It  may  be  seen  above  that  yj  ^  underestimates  E[t\  in  all  cases,  but  does  so  more 
severely  for  the  cubes  than  for  the  arcs.  Also  notice  that  the  standard  deviation  of  r  is 
greater  for  the  cubes  than  it  is  for  the  arcs. 

1.9.  Summary 

It  has  been  shown  that  the  approximation 

(1.4S)  F.[t)* 

holds  in  a  wide  variety  of  collision  settings.  For  cases  considered  for  which  the  distribution 
of  r  is  known  exactly,  the  two  sides  of  (1.48)  are  proven  to  be  asymptotically  equivalent. 
For  packing  sequences  of  equivalent  points,  asymptotic  equivalence  is  made  plausible  but 
has  not  been  proven.  That  (1.48)  can  hold  for  a  packing  sequence  not  possessing  equivalent 
points  was  demonstrated  by  the  simple  cubic  packing  scheme.  The  formula  (1.48)  does 
not  hold  for  arcs  of  unequal  or  variable  length  on  the  circle:  however,  note  that  in  these 
settings  a  collision  is  no  longer  defined  by  the  simple  relation 

C,  A  C;  o  n(C,,C])  <  6 

for  some  fixed  <5. 

The  approximation  formula 

(1.49)  P{t  >  k}  «  e~^p 

has  also  been  investigated.  For  random  arcs  of  constant  length.  (1.49)  is  reasonably 
accurate  whenever  k  <  p~2/3.  This  is  suggested  by  Proposition  1.12  and  demonstrated  by 


fljMMgaasa&aaaa^ 


>.v.v.v.v.v:>  v.v.v.v.v.vtv.' 
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the  results  shown  in  Table  A  The  results  of  Table  A  also  indicate  that  ( 1 .49  i  can  be  quite 
inaccurate  if  k  >  p~2/3.  It  is  reasonable  to  suppose  that  the  cutoff  point  occurs  in 

other  packing  schemes  as  well 


Chapter  2 


Approximate  Packing  Densities  of 
Randomly  Constructed  Codes 


2.1.  Introduction 

Consider  the  sequence  of  points  C\ ,  C2,  •  •  •  chosen  independently  and  uniformly  from 
5.  Each  point  in  the  sequence  will  be  considered  to  be  either  packed  or  rejected.  A  point 
will  be  rejected  if  and  only  if  it  collides  with  a  previously  packed  point.  Otherwise  the 
point  will  be  packed.  Thus  Cj  will  be  packed,  and  C2  will  be  packed  unless  it  collides 
with  Ci.  Then,  letting  P ,  denote  the  union  of  all  points  from  among  which 

have  been  packed.  C,+i  will  be  packed  unless  it  collides  with  some  member  of  P,. 

The  members  of  P,  are  said  to  constitute  a  saturated  packing  if  each  point  in  5 
collides  with  at  least  one  element  of  P,.  Thus,  if  P,  is  a  saturated  packing,  none  of 
C,+i .  C,+2 .  •  •  •  can  be  packed  and  P,  =  P,+ 1  =  P.+2  =  •  •  •• 

Let  T  be  defined  as  the  first  time  i  for  which  P,  is  a  saturated  packing.  Call  P,  a 
stopping  time,  the  time  to  saturation  for  the  packing  sequence.  Let  M  be  the  random 


variable  defined  bv 


M  =  #Pr- 


Thus  M  counts  the  number  of  points  in  a  saturated  packing,  and  T  is  the  total  number 
of  selections  from  5,  including  rejections,  needed  to  reach  saturation. 

For  a  metric  space  (S. p),  suppose  a  suitably  defined  content  v  is  associated  with 
each  point.  Then  let  the  random  packing  density,  denoted  by  p ,  be  defined  as 


p  =  r  E[M). 


r 

Vi 

rjt 
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If  5  is  finite,  the  content  v  may  be  taken  to  be  (#S)_l.  For  this  choice  of  t\  the  packing 
density  will  be  referred  to  as  a  center  density  and  will  be  denoted  by  r.  Hence 


'  ■ 


Random  packing  problems  have  been  studied  by  numerous  investigators;  however, 
attempts  to  obtain  exact  solutions  have  been  met  with  very  little  success  except  for  the 
cases  of  packing  on  the  discrete  and  continuous  circles  (or  line  segments).  The  one- 
dimensional  variations  are  often  referred  to  as  “parking  problems"  since  the  general  idea 
can  be  expressed  by  the  question  ‘How  many  cars  of  length  a  can  eventually  be  parked  on 
a  street  of  length  x  if  the  parking  is  done  at  random?'.  Put  this  way,  it  is  also  assumed 
that  parked  cars  are  never  moved  and  that  cars  will  move  on  to  other  streets  if  they  cannot 
fit  next  to  the  curb  at  their  randomly  chosen  locations. 

Random  packing  problems  in  spaces  of  dimensions  two  and  three  are  also  of  inter¬ 
est,  partially  because  they  can  be  used  to  model  physical  phenomena  such  as  molecular 
adsorption  and  liquid  and  plasma  structure.  It  is  also  of  interest  to  compare  the  packing 
densities  obtained  with  the  density  from  an  analogous  one-dimensional  case.  A  conjecturp 
of  Palasti  [44]  suggests  that  a  two-dimensional  density  should  be  equal  to  the  associated 
one-dimensional  density  squared;  however,  the  conjecture  has  never  been  proven  and  nu¬ 
merous  simulation  studies  have  indicated  that  it  is  not  true.  More  will  be  said  about  this 
conjecture  in  Section  2.5. 

Solomon  [53]  reviews  the  major  findings  in  the  one-dimensional  settings,  and  discusses 
efforts  to  approximate  densities  in  higher  dimensions  by  simulation  results.  A  more  recent 
survey  article  by  Solomon  and  Weiner  [55]  updates  this  material. 

The  study  of  random  packing  densities  in  multidimensional  spaces  through  computer 
generated  packings  is,  in  general,  difficult  due  to  the  extremely  long  running  times  required 
to  perform  the  random  packings.  However,  if  each  coordinate  of  a  random  point  must 
assume  one  of  only  a  small  number  of  possible  values,  then  the  time  required  to  achieve  a 
saturated  packing  need  not  be  unreasonably  long.  Random  packing  sequences  on  the  space 


r ,  r.  »  .  r. 
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of  q- ary  codewords  provide  convenient  and  interesting  ways  to  study  packing  densities  in 
spaces  of  two  or  more  dimensions.  Various  random  coding  schemes  will  be  examined  in 
the  next  five  sections. 

2.2.  Random  Binary  Codes 

Consider  the  metric  space  introduced  in  Section  8  of  Chapter  1.  and  let  q  equal  2. 
Thus  5  is  the  space  of  all  binary  codewords  of  length  n.  There  are  2"  such  codewords, 
and  each  one  can  be  represented  by  a  unique  r?-component  vector  with  each  component 
being  either  0  or  1. 

The  Hamming  weight  of  a  codeword  u  is  defined  to  be  the  number  of  nonzero  com¬ 
ponents  of  u,  and  the  Hamming  distance  between  two  codewords  u  and  v  is  the  Hamming 
weight  of  u  -  v,  where  modulo-2  arithmetic  is  applied.  Note  that  this  definition  of  Ham¬ 
ming  distance  is  equivalent  to  the  one  given  in  Chapter  1. 

Subsets  of  5  containing  two  or  more  elements  will  be  called  codes.  A  code  is  called 
an  (n.d)- code  if  the  codewords  are  of  length  n  and  the  distance  between  each  pair  of 
words  is  greater  than  or  equal  to  d.  The  minimum  distance  d  is  an  important  parameter 
in  the  description  of  a  code  since  it  is  related  to  the  error  detecting  and  error  correcting 
capabilities  of  the  code.  It  is  possible  to  detect  up  to  d  -  1  errors,  where  an  error  is 
said  to  occur  when  a  bit  is  recorded  incorrectly  at  the  receiving  end.  Furthermore,  up  to 
l(d-  l)/2]  transmission  errors  can  always  be  successfully  corrected. 

Usually  codes  are  constructed  by  algebraic  methods  in  order  to  “neatly  arrange" 
the  codewords,  and  hopefully  produce  (n,  decodes  which  have  the  maximum  number  of 
words.  A(n.d)  will  denote  the  number  of  words  in  the  largest  possible  (n,d)-code.  Codes 
constructed  algebraically  may  also  have  nice  properties  which  make  decoding  and  error 
correcting  easier.  Despite  a  large  literature,  the  known  “good"  codes  are  relatively  few  in 
number. 

The  problem  investigated  in  this  section  is  the  random  generation  of  various  types 
of  binary  error-correcting  codes.  While  the  primary  moti\ation  for  this  work  was  to  learn 
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something  about  random  packing  densities  in  high-dimensional  spaces,  it  is  also  of  interest 
to  see  how  the  sizes  of  random  codes  compare  with  the  sizes  of  similar  codes  constructed 
deterministically. 

Itoh  and  Solomon  [27]  consider  the  random  sequential  construction  of  binary  codes 
for  various  values  of  n  and  d.  They  begin  the  construction  process  by  selecting  at  ran¬ 
dom  a  single  codeword  from  5.  Then  the  second  word  is  chosen  at  random  from  among 
all  codewords  which  are  at  Hamming  distance  d  or  greater  from  the  initial  word.  This 
procedure  is  continued,  at  each  step  choosing  a  new  codeword  from  the  collection  of  all 
codewords  which  are  at  a  Hamming  distance  of  d  or  greater  from  all  of  the  codewords 
previously  selected.  The  process  terminates  when  it  is  no  longer  possible  to  add  another 
codeword  to  the  chosen  set.  This  procedure  corresponds  to  the  sequential  packing  scheme 
described  in  Section  1.  provided  that  a  pair  of  codewords  are  considered  to  collide  when¬ 
ever  the  Hamming  distance  between  them  is  less  than  d.  Thus  the  random  (n.d)-code 
formed  is  just  a  saturated  packing  of  codewords. 

Letting  M(n.d)  denote  the  number  of  words  contained  in  a  saturated  packing,  the 


center  densitv  is  denoted 


rd,n  =  2-"£[M(n,d)]. 


This  densitv  tnav  be  estimated  bv 


rd,n  =  2~n  M  (n,d). 


where  M{n,d)  is  the  average  number  of  words  packed  in  a  Monte  Carlo  experiment.  That 
is,  if  N  random  packing  attempts  result  in  codes  being  formed  having  A’i,  A'2,  •  •  • ,  AV 
words,  then  M{n,d)  =  A",. 

The  results  of  numerous  packing  attempts  performed  by  computer  are  summarized 
in  Appendix  A.  The  data  for  binary  cases  with  3  <  n  <  17  are  taken  from  [27].  The  data 
for  cases  with  n  >  17  are  results  from  more  recent  simulations.  It  should  be  noted  that 
the  number  of  trials  per  case  ranges  from  10.000  for  cases  having  n  <  10  to  only  10  trials 
for  most  cases  having  n  >  17. 


rvr^ r-tr.  A  A*’,  *r.  <-VV  A-’,  A  A/.  **,.•* 
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For  example,  consider  the  first  entry  in  Appendix  A.  This  says  q  =  2  (a  binary 
problem],  n  =  3  (so  binary  triples  are  being  considered),  and  d  =  2  (so  any  two  words  in 
the  packing  must  differ  in  at  least  two  coordinates).  The  Monte  Carlo  estimates  for  the 
mean  and  the  standard  deviation  are  about  3.49  and  0.87.  Unfortunately,  large  standard 
deviations  are  an  inherent  part  of  the  problem  in  this  case  as  well  as  in  most  of  the  other 
cases  considered. 

Itoh  and  Solomon  propose  that  the  form  n~'"‘  approximates  the  rj  n  reasonably  well 
for  the  cases  d  —  2  and  d  =  3.  where  the  -vd  are  constants.  They  estimate  y?  and  *>3  from 
logarithmic  plots  of  the  simulation  results  using  the  least  squares  method.  Only  the  cases 
for  which  10  <  n  <  17  are  used  in  the  fit. 

The  following  tables  compare  their  fitted  values  with  the  simulation  results.  Below, 
and  throughout  the  chapter,  s.d.  will  be  used  to  denote  the  estimated  standard  deviation 
obtained  from  the  sample. 

d  =  2  cases 
72=0.6249 


n 

h.n 

„-o2  (7,-02 

-  r2.n)/ 

8 

0.26914 

0.27269 

12.7 

9 

0.25215 

0.25334 

6.1 

10 

0.23656 

0.23720 

5.0 

11 

0.22324 

0.22349 

0.9 

12 

0.21179 

0.21166 

-0.7 

13 

0.20187 

0.20133 

-1.4 

14 

0.19257 

0.19222 

-1.5 

15 

0.18399 

0.18411 

0.7 

16 

0.17677 

0.17683 

0.5 

17 

0.17018 

0.17026 

0.3 

WWW# 


v.y.vy.v; 


76  Chapter  2  Approximate  Packing  Densities  of  Randomly  Constructed  Codes 


d  =  3  cases 
73=1.319 


72 

^3,n 

n~°3 

(n  -  r3 n)/s. 

8 

0.06430 

0.06435 

1.5 

9 

0.05543 

0.05509 

-13.3 

10 

0.04834 

0.04794 

-23.2 

11 

0.04263 

0.04228 

-9.9 

12 

0.03800 

0.03769 

-12.9 

13 

0.03410 

0.03392 

-3.1 

14 

0.03077 

0.03076 

-0.3 

15 

0.02802 

0.02808 

2.9 

1G 

0.02557 

0.02579 

14.1 

17 

0.02346 

0.02381 

8.5 

It  car  be  seen  that  the  fit  is  rather  good  for  d  =  2  and  n  >  11;  however,  the  model  does 
not  provide  a  very  good  fit  for  the  case  of  a  =  3. 

In  Chapter  1  it  was  shown  that  the  expected  value  of  the  stopping  time  r  could 
be  approximated  by  a  simple  function  of  p.  the  probability  that  two  arbitrary  points 
collide.  Similarly,  for  binary  codeword  packing,  the  center  densities  rdn  can  be  closely 
approximated  by  a  function  of  the  ratio  ^  provided  that  n  is  large  enough.  For  the  space 
of  binary  codewords  of  length  n,  the  content  t-  is  denoted  by  vn  and 

,n  =  (#S)-1  =  2-n. 

Let  pin  be  the  probability  that  two  randomly  selected  codewords  of  length  n  are 
separated  by  a  Hamming  distance  less  than  d.  Then 


"w 
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Letting  8d.n  denote  the  ratio  Pd.nh'd.n-  it  follows  that 

d- 1  /  v 


It  is  found  that 

fdWd.n)  =  Od0n  + 

approximates  the  observed  f^n  rather  well  for  n  not  too  small,  where  the  qj  and  the  *3d 
are  constants.  The  values  of  these  constants  are  determined  by  performing  a  weighted 
nonlinear  least  squares  fit  on  each  set  of  points  rdtHd)- ^d,L{d)+ 1<  •  •  • ,  fd.U(d)  (d  =  1,  •  •  • .  10) 
using  a  modified  Gauss-Newton  algorithm  (BMDP  program  3R  was  used).  Each  case  was 
weighted  by  the  inverse  of  the  estimated  variance  of  fd,n-  The  L(d)  were  determined  by 
trial  and  error,  and  were  chosen  to  be  as  small  as  possible  while  keeping  the  resulting  fit  ac¬ 
ceptably  accurate.  The  U(d)  are  chosen  to  be  as  large  as  possible,  the  values  being  limited 
only  by  the  enormous  amount  of  computer  time  required  to  perform  the  simulations. 

The  following  table  summarizes  the  parameters  resulting  from  the  curve  fittings. 

Parameters  determined  for  binarv  cases 


d 

L(d) 

U(d) 

Od 

0  4 

2 

9 

17 

0.331 

0.630 

3 

12 

17 

-0.870 

0.740 

4 

10 

17 

-3.20 

0.736 

5 

12 

18 

-22.1 

0.795 

6 

11 

18 

-55.6 

0.795 

7 

15 

18 

-2.04  x  102 

0.833 

8 

14 

18 

-7.60  X  102 

0.831 

9 

17 

19 

-4.48  X  103 

0.851 

10 

17 

20 

-7.68  x  103 

0.861 

The  entries  in  the  next  table  are  the  residuals,  fdi^d.n)  ~  ^d. n-  divided  by  the  estimated 
standard  deviation  of  fd 


T 


J*  h 


f 

? 
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Standardized  residuals 


0.630  for  a2  and  02  respectively.  The  standardized  residuals  from  this  fit  are  given  in  the 
first  column  of  the  second  table.  For  instance,  the  n  =  11  entry  shows  that  /2(02in)  under 
approximates  r211  by  0.3  s.d.(f2  n).  Here 

f2(02M)  =  9^1  +026^. 

Note  from  the  first  table  above  that  #3  %  04,  05  v  06,  07  ss  0s,  and  0 9  %  0IO. 
Further  note  that  if  B(d)  =  f^"|,  then  0^^  is  approximately  constant.  This  fact  will 
be  discussed  further. 

If  the  r^  n  are  fit  with  the  approximation 

=  *5+aA-“*!. 


n 

<M 

II 

T3 

d  =  3 

d  =  4 

d  =  5 

d  =  6 

d=  7 

d  =  8 

d  =  9 

O  , 

H 

II 

tJ 

9 

0.8 

11.9 

-6.0 

-29.2 

- 

- 

- 

- 

10 

0.3 

11.3 

-0.1 

-37.5 

-39.3 

- 

- 

- 

- 

11 

-0.3 

3.0 

0.8 

-8.7 

0.1 

-65.3 

- 

- 

12 

-1.6 

0.3 

-0.8 

-0.3 

-0.2 

42.2 

-94.8 

- 

_ 

13 

-1.6 

-0.4 

0.6 

0.9 

-0.2 

6.4 

-46.7 

- 

- 

14 

-1.5 

0.5 

0.2 

0.1 

0.7 

4.7 

-0.5 

- 

1 

15 

0.9 

-1.9 

-0.8 

0.4 

0.0 

0.0 

0.9 

- 

j 

16 

0.9 

0.6 

0.4 

0.1 

0.2 

-0.2 

0.1 

-16.2 

-52.7  : 

17 

0.4 

1.2 

0.6 

-1.4 

0.3 

1.3 

-0.4 

-0.9 

-0.3  3 

; 

18 

- 

- 

- 

0.4 

-1.6 

-0.7 

-0.8 

1.1 

N 

-0.1 

h 

19 

- 

- 

- 

- 

- 

- 

- 

-0.6 

* 

2.1  f 

20 

- 

- 

- 

- 

- 

- 

- 

- 

-1.3  i 

r 

For 

example,  consider  the  < 

d  =  2 

entry  of 

the  first 

table. 

This  shows 

that  the  fit  !' 

done 

using 

^2.9.  ^2,10 

>•  •  -,h,\ 

7,  and 

that  these  points  led  to  the  estimates 

of  0.331  and  ‘ 

1 
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the  estimated  ad  remain  the  same  as  before.  The  estimate  of  j3  gotten  from  each  fit 
is  shown  below,  along  with  corresponding  estimates  of  the  standard  deviation  of  3.  The 
estimated  standard  deviations  are  computed  with  a  provisional  means  algorithm  (see  [10]). 


0.3975 


0.4059 


0.3990 


0.39S5 


0.3995 


0.4022 


0.3962 


0.3800 


0.4074 


s.d.  (0) 
0.0009 
0.0006 
0.0009 
0.0004 
0.0013 
0.0030 
0.0038 
0.0126 
0.0148 


These  results  suggest  that  perhaps 


Td,rx  ~  &dn  as  n  —  oc, 

for  some  3  approximately  equal  to  0.40.  This  asymptotic  form  has  the  ratio  rdn+l/rdri 
tending  to  1  as  n  tends  to  infinity,  in  agreement  with  an  observation  made  by  Itoh  and 
Solomon  [27],  As  with  other  packing  settings,  there  does  not  seem  to  be  a  simple  heuristic 
argument  which  suggests  the  limiting  form  of  the  densities. 

A  Comparison  with  the  Hamming  Bound. 

The  Hamming,  or  sphere-packing,  bound  for  codes  guarantees  that  any  t-error- 
correcting  binary  code  of  length  n  containing  M  codewords  satisfy 


M  1  + 


(see  [38]).  Since  the  number  of  bit  errors  an  (n,  d)-binarv  code  can  successfully  correct  is 


[(rf-l)/2)]. 
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the  bound  implies  that  the  number  of  words  in  any  (ri,d)-code  cannot  exceed 

r  /„\  /  _  \  i -i 


r,  (n 

\ 

71  \ 

H>. 

)+- 

+( 

,(W. 

if  d  is  odd,  and  the  number  of  words  cannot  exceed 


HD- 

/ 

'  Ti  Y 

■  + 

(d-2  )  j 

A  2  )/. 

if  d  is  even. 

It  is  also  true  that  any  saturated  packing  which  constitutes  an  (n.  d)-binary  code 
must  contain  at  least 


1 + c 

A 

(  n  X 

+  • 

•+  1  , 

V 

1/ 

\d-  l)\ 

words.  This  follows  from  the  fact  that  the  addition  of  each  new  word  to  the  packed 
collection  can  decrease  the  number  of  available  sites  by  at  most  £j=o  (”)■ 

The  above  bounds  can  be  used  to  establish  the  facts  that 

V«  >  L(d.n)=  [l  +  f7')  +  •••+  (  ,n  ')]  , 


1  +  (")  +  •  •  •  +  (i<t- 1 (d  odd) 

(“)  -i 

1  +  (")  + - h  ((i^2))  (d  even). 

Table  B  contains  the  ratios  fjn/L(d,n),  and  Table  C  contains  the  ratios  fj^/L’id^n). 
From  an  inspection  of  these  tables  it  can  be  seen  that  if  d  is  held  fixed  and  n  is  made 
larger,  then  the  ratios  of  form  r^.n/Tfd,  n)  tend  to  increase  with  n,  while  the  ratios  of  the 
form  Td,nlU(d,n)  are  decreasing  as  n  increases. 

Now  if  the  proposed  asymptotic  form  for  given  by 


R(d,  n)^e~^ 


is  correct ,  then  as  n  becomes  large 


R(d,n)/L(d,n) 


^  *.  ^  V  V  « 
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Table  C 

The  entries  in  this  table  are  the  ratios  fdtn/U(d,n) 
d = 3  d  =  4  d= 5  d=6  d=7  d= 8  d  =  9 


d  =  2 

d  =  3 

II 

■^3 

0.437 

- 

- 

0.388 

- 

- 

0.346 

0.727 

- 

0.315 

0.680 

0.368 

0.290 

0.621 

0.321 

0.269 

0.579 

0.290 

0.252 

0.554 

0.256 

0.237 

0.532 

0.227 

0.223 

0.512 

0.204 

0.212 

0.494 

0.187 

0.202 

0.477 

0.169 

0.193 

0.461 

0.156 

0.184 

0.448 

0.145 

0.177 

0.435 

0.133 

0.170 

0.422 

0.123 

- 

— 

— 

d  =  10 


— 

- 

- 

- 

0.551 

- 

_ 

0.437 

0.295 

- 

- 

0.415 

0.242 

- 

- 

0.382 

0.210 

0.428 

- 

0.350 

0.190 

0.291 

0.242 

0.323 

0.169 

0.2S0 

0.183 

0.301 

0.147 

0.260 

0.145 

0.279 

0.131 

0.243 

0.136 

0.260 

0.116 

0.216 

0.124 

0.244 

0.102 

0.189 

0.110 

0.227 

0.097 

0.172 

0.097 

0.329 

0.237 

0.204 

0.172 

0.159 

0.152 


0.190 

0.151 

0.103 

0.102 

0.082 

0.082 
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should  tend  to  something  greater  than  or  equal  to  1.  Similarly, 

R(d,n)/U(d,  n) 

should  approach  something  less  than  or  equal  to  1  as  n  becomes  large.  It  will  now  be 
shown  that  the  first  ratio  tends  to  infinity,  and  that  the  second  ratio  has  a  limit  of  zero. 


Note  that  L(d.n)  is  equal  to  6dln,  and  so 


,.  R(d.n) 

am  773 — r  =  lim  — — } — 
"  L{d,n)  n  Qdln 

=  l‘m  ed.n 


Since 


ed.n  =  o(nd~1). 


and  since 


1  _  Q'std’i  >  o 


if  3  is  about  0.40,  it  follows  that 


lim 

71—*  CC 


R(d,n) 

L(d,n) 


=  oc. 


If  d  is  even  then 

U(d,n)  =  e~S  , 

j,n 

and  it  follows  that 

U(d,n)  y  ’ 

where 

x(d)  =  i-i-(d-i)0*fo. 

Now  putting  |  in  for  0  and  replacing  B(d )  by  yields 


Vi. 
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which  is  less  than  zero  for  all  even  values  of  d  greater  than  or  equal  to  two.  Hence  it  now 


follows  that 


R{d,n)  n 
lim  777-j — r  =  0 
n—00  U(d,  n) 


for  even  values  of  d.  It  may  similarly  be  shown  that  the  above  result  is  also  true  for  the 
case  where  d  is  odd. 

The  above  results  indicate  that  the  proposed  asymptotic  formula,  R(d.n).  is  consis¬ 
tent  with  the  Hamming  bounds.  Another  well  known  bound  on  the  size  of  a  code,  the 
Plotkin  bound,  cannot  be  treated  in  a  similar  fashion.  This  is  because  the  Plotkin  bound 
only  applies  to  cases  for  which  n  <  2d.  Hence  it  is  not  possible  to  discuss  its  behavior 
when  d  is  fixed  and  n  is  allowed  to  become  large.  It  is  also  not  very  meaningful  to  compare 
the  observed  sizes  of  the  random  codes  with  the  values  of  the  maximum  code  size  A(n.d). 
This  is  due  to  the  fact  that  the  values  of  A(n,d)  are  not  known  in  all  cases,  and  also 
because  the  values  which  are  known  tend  to  exhibit  somewhat  haphazard  patterns. 

2.3.  Nonbinary  codes 

Nonbinarv  codes  may  also  be  sequentially  constructed  using  the  method  described 
in  the  previous  section.  If  m  is  a  prime  number,  and  q  is  any  power  of  m,  then  a  code 
with  symbols  from  the  Galois  field  GF(q)  is  called  a  9-ary  code.  An  (n,d)-<j-ary  code  is  a 
subset  of  n-tuples  over  GF(q).  The  n-tuples,  or  codewords,  are  such  that  the  Hamming 
distance  between  any  pair  of  them  is  greater  than  or  equal  to  d.  As  before,  the  Hamming 
distance  between  two  words  u  and  v  is  just  the  Hamming  weight  of  u  —  v. 

Ternary,  or  3-ary,  (n,  decodes  were  stochastically  formed  by  computer  for  2  <  d  <  7. 
As  was  the  case  for  the  binary  codes,  it  is  found  that  the  observed  fd,n  can  be  approximated 
by  expressions  of  the  form 

M*d.n)  =  C  +  ^n'- 

Note  that  for  o-arv  codeword  packing  the  ratio  p/v  is  given  by 


ft 
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The  results  of  the  random  packings  are  summarized  in  the  two  tables  below.  The  notation 
is  the  same  as  before,  and  additional  details  concerning  the  simulation  results  may  be 
found  in  Appendix  A. 

Parameters  determined  for  ternary  cases 


d 

L(d) 

U(d) 

ad 

0d 

2 

5 

9 

-4.53  x  10-2 

0.632 

3 

7 

9 

-1.52 

0.742 

4 

6 

10 

-10.4 

0.764 

5 

8 

10 

-33.5 

0.803 

6 

9 

11 

-2.32  X  102 

0.819 

7 

10 

11 

-1.20  x  103 

0.836 

Standardized  residuals 


n 

d  =  2 

d  =  3 

=  4 

d  —  5 

to 

II 

•X3 

d  =  7 

-  ■ 

5 

0.6 

6.0 

3.8 

- 

- 

- 

3 

6 

-0.6 

6.2 

-0.4 

- 

- 

- 

7 

-0.3 

-0.3 

0.4 

14.8 

- 

- 

8 

0.0 

1.4 

0.5 

-0.2 

-23.2 

_ 

9 

0.4 

-0.5 

-0.8 

0.3 

-0.1 

-43.9 

10 

- 

- 

2.4 

-1.4 

1.0 

0.0 

r.. 

11 

- 

- 

- 

- 

-0.4 

0.0 

It  is  interesting  to  note  that  the  values  and  $3  are  very  close  to  the  corresponding 
values  for  the  binary  case.  It  may  also  be  noted  that  if 

r  2  d  =  2 

then  the  values  (d  =  2,  •  •  • ,  7)  are  approximately  equal  to  one  another,  and  therefore 
they  are  all  near  some  unknown  constant  0. 


J2 

\d-^  d> 
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Estimates  of  3  from  the  ternary  cases 

d  3»(d) 

2  0.399 

3  0.408 

4  0.390 

5  0.416 

6  0.408 

7  0.407 

Similar  to  the  binary  case,  these  results  suggest  that 

T 

rd-n  ~  9d'n  as  n  — ♦  oc, 

for  some  3  approximately  equal  to  0.40  or  0.41. 

It  is  also  interesting  to  compare  the  pattern  of  the  values  of  Var(A/(n.d))  in  the 
ternary  case  with  the  pattern  found  in  the  binary  case.  For  the  binary  case.  Itoh  and 
Solomon  [27]  observed  that  the  variance  of  packing  density  is  larger  when  d  is  even  and 
smaller  when  d  is  odd.  Also  it  may  be  seen  from  Appendix  A  that  the  ratio 

Var(A/(n.<f))/Var(A/(n,d+  1)) 

is  generally  greater  than  one  when  d  is  even,  and  generally  less  than  one  when  d  is  odd. 
This  pattern  is  not  evident  for  the  ternary  data,  as  it  may  be  seen  from  the  results  given 
in  Appendix  A  that  the  above  ratio  is  greater  than  one  for  all  d.  For  the  q  —  4  and 
q  =  5  cases,  the  values  of  M(n,d)  also  strictly  decrease  as  d  increases  (with  n  held  fixed): 
however,  it  should  be  mentioned  that  there  is  insufficient  data  in  these  cases. 

For  q  >  3,  packings  could  be  generated  for  only  small  values  of  n  due  to  limitations  on 
computer  time.  However,  for  each  combination  of  q  and  d  the  values  of  -  log  rd  rJ  \og0d 
suggest  that  perhaps 
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holds  for  q  =  4  and  q  =  5.  The  tables  below  give  the  observed  values  of  -  log  fj-n/  log^,n 
for  q  =  4  and  q  -  5.  Additional  simulation  results  may  be  found  in  Appendix  A. 


Estimates  of  the  3d  from  the  4-ary  case 


fl 

d  =  2 

q  =  4 

d  =  3 

II 

.fe. 

3 

0.679 

- 

- 

4 

0.664 

0.748 

- 

5 

0.657 

0.754 

0.812 

6 

0.654 

0.754 

0.798 

7 

0.654 

0.752 

0.789 

Estimates  of  the  3d  from  the 

q  =  5 

5- ary  cases 

n 

d-  2 

d  =  3 

d  =  4 

3 

0.695 

- 

- 

4 

0.679 

0.758 

- 

5 

0.671 

0.753 

0.803 

6 

0.667 

0.754 

0.800 

Packing  by  Lee 

distance 

The  Lee  distance  between  two  q-ary  codewords  x  =  (ij,  and  y  =  {y-\ .  •  •  • .  yT, ) 

is  given  by 

n 

M*,y)  =  „y,) 

i=i 

where 

(\a-b\  if  |a  —  6|  <  q/2 
iy(a,b)  =  ^ 

(  q  —  |a  —  h|  otherwise. 

Two  codewords  are  considered  to  collide  if  the  Lee  distance  between  them  is  less  than 
some  fixed  c  €  { 1 , 2,  •  •  •} . 


Note  that  Lee  distance  is  the  same  as  Hamming  distance  if  q  is  equal  to  2  or  3. 
But  for  larger  q,  Lee  distance  is  a  more  sensitive  metric  since  the  contribution  from  each 
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coordinate  can  be  something  other  than  just  0  or  1.  Lee  distance  is  sometimes  used  by 
coding  theorists  because  it  is  well  suited  to  phase-modulation.  (The  Hamming  metric  is 
well  suited  to  orthogonal  modulation  schemes.) 

Consider  first  the  case  where  n  is  equal  to  1.  Then  the  metric  space  employing  the 
Lee  metric  is  identical  to  the  one  described  in  Section  3  of  Chapter  1.  A  random  packing 
on  this  metric  space  may  be  viewed  as  placing  at  random  arcs  of  length  c  on  a  circle 
of  circumference  q ,  with  the  endpoints  of  the  arcs  being  situated  at  integral  coordinates. 
Nonoverlapping  arcs  are  packed  until  the  length  of  the  longest  segment  of  the  circle  not 
covered  by  an  arc  is  less  than  c.  Thus,  this  somewhat  degenerate  codeword  packing 
problem  is  just  a  variation  of  a  discrete,  one-dimensional  parking  problem. 

Let  M\,c,q  denote  the  number  of  arcs  in  the  saturated  packing,  and  let  the  content  r 
of  each  packed  point  be  which  is  just  the  proportion  of  the  circle  covered  by  each  arc. 
Then  the  random  packing  density  is  given  by 

c 

Pl.c.q  — 

which  is  just  the  average  proportion  of  the  circle  covered  by  a  saturated  packing  of  arcs. 
Note  that  p\,\,q  is  trivially  equal  to  1,  since  eventually  every  segment  on  the  circle  will  be 
covered  by  a  packed  arc. 

Page  [43]  and  Downton  [12]  have  studied  this  packing  sequence  for  the  special  case 
of  c  equal  to  2.  They  proved  that  as  q  tends  to  infinity,  p\,2,q  tends  to  the  finite  limit 

P\,2  =  1  ~  e-2=0.8647. 

Using  some  results  of  Mackenzie  [36],  it  can  be  proved  that  for  c  €  {1,2,  •  •  •} 

Pi.c.q  P\. c  as  9  — 1  oc. 

where  the  sequence  of  limits  {pi,c}£ij  remains  finite  as  c  tends  to  infinity.  In  general.  p\,c 
must  be  obtained  by  numerically  integrating  an  expression  which  arises  from  a  recurrence 
technique.  However,  it  is  trivial  that  pi,i  =  1.  and  for  c  =  2  the  required  integral  may  be 
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evaluated  in  closed  form  to  yield  p\_ 2  =  1  -  f  2  in  agreement  with  Page. 

Mackenzie's  work  also  suggests  that 

0.2162  0.0360 

(2.1)  P\  c  —  0.7476+  - + - 5 —  as  c  —  oc. 

c  ci 

The  first  two  constants  are  obtained  in  [36]  via  numerical  integration;  however,  the  co¬ 
efficient  of  c-2  has  been  estimated  empirically  by  Mackenzie.  For  c  >  2.  (2.1)  gives  p in¬ 
correct  to  four  significant  figures. 

It  should  be  noted  that  0.7-476  is  the  asymptotic  packing  density  for  arcs  placed 
uniformly  on  the  circle.  This  value,  denoted  by  Pi.x-  was  first  obtained  by  Renyi  [47]  and. 
to  be  more  precise,  it  may  be  defined  by 

Pi.x  =  lim  px(a). 
a[Q 

where  p-ja)  is  just  the  average  proportion  of  the  unit  circle  which  is  covered  by  a  satu¬ 
rated  packing  of  arcs  of  length  a  whose  centers  are  chosen  according  to  a  uniform  (0. 1] 
distribution.  Blaisdell  and  Solomon  [4;  found  that 

pl  3C=0. 74759  79202  53398 

by  developing  explicit  bounds  which  can  determine  fifteen  significant  digits  correctly. 

Now  consider  cases  for  which  n  is  equal  to  2,  and  where  two  points  collide  when¬ 
ever  the  Lee  distance  between  them  is  less  than  d  6  {2, 3,  •••}.  This  is  a  discrete  two- 
dimensional  analog  to  the  one-dimensional  packing  problem  considered  previously  in  this 
section.  In  general,  parking  problems  in  two-dimensions  are  notoriously  difficult  and  resist 
all  attempts  at  mathematical  solutions. 

Some  simulation  results  are  summarized  in  Appendix  B.  Letting  denote  the 

number  of  points  in  a  saturated  packing,  these  results  indicate  that  the  center  densities 

r2,d,q  =  q~2E[M74,q) 

are  approximately  constant  for  each  fixed  d  and  various  choices  of  q  >  10 d.  Hence  it  seems 


v;. 
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likely  that 

r2.d.q  —  r2.d  ZS  9  —  3C  (d  >  2) 

for  some  undetermined  sequence  of  values  r2,2,  r2  3.  r2  <.  •  ■  For  example,  consider  the 
ejght  entries  having  n  =  2  and  d  =  3  These  results  certainly  suggest  that  the  r2  3  ?  ma\ 
tend  to  a  limit  around  0.1398  as  q  becomes  large. 

Let  f2  d  q  denote  the  observed  average  of  q~2SI2.d.q  for  all  simulation  trials  performed 
with  parameters  d  and  q,  and  let  r2j  be  the  average  of  all  of  the  f2.d.q  for  which  q  >  10d. 
The  values  r.d  ( d  >  2)  shown  below  will  serve  as  estimates  of  the  limiting  value*  r2  d 
(d  >  2). 

Estimated  limits  for  the  center  densities 


d 

i~2.d 

2 

0.3642 

3 

0.1398 

4 

0.08025 

5 

0.04903 

6 

0.03415 

1 

0.02453 

8 

0.01882 

9 

0.01467 

10 

0.01186 

11 

0.009722 

12 

0.008176 

Although  it  is  difficult  to  approximate  the  above  limiting  densities  analytically,  sev¬ 
eral  relationships  among  the  center  densities  can  be  observed.  In  what  follows,  a  scheme  is 
developed  for  producing  planar  densities  from  the  center  densities.  Then  it  will  be  shown 
how  relationships  among  the  limiting  center  densities  and  among  the  limiting  planar  den¬ 
sities  can  be  approximated  by  simple  functions  of  the  parameter  d.  Finally,  it  will  be 
shown  that  both  approximation  schemes  can  be  used  to  produce  estimates  of  an  overall 
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in:..:;:!!:  .  .  .  a :.d  that  the  two  methods  produce  estimates  winch  agree  closely  with 


-a 


Let  0- 
case  Then 


q:j>:  i  q.  where  P2.d.q  is  the  probability  that  two  points  collide  in  the  (d.q) 


d~2 

@2.d  =  1  +  4  Jjd-  1  ~  J) 

j  =  0 


=  2d(d-  1)  +  1 


Niva  ?u|-j>o>»  that  each  packed  point  serve-  a*-  the  position  j*« •:!.*  id  a  "d.amotid " 

shaped  configuration  of  t-2 j  points  which  are  fixed  relative  to  the  position  point.  The 
contents  t'2.3-  V2A- '  •  •  are  taken  to  be  as  large  as  possible,  subject  to  a  constraint 
requiring  that  the  diamonds  surrounding  two  disjoint  points  should  contain  no  common 
points.  For  d  odd.  V2,d  =  (d2  i-  l)/2.  The  diamond  consists  of  a  row  of  d  points  centered 
on  the  position  point,  and  sandwiched  around  this  row  are  pairs  of  successive!;*  smaller 
rows  of  d  -  2.  d  -  4.  •  •  • .  3.  and  1  points,  each  centered  on  the  position  point .  For  d  even. 
t’2 ,d  =  d? 1 2:  however,  it  is  now  not  possible  to  have  the  position  point  being  at  the  exact 
center  of  the  configuration.  The  diamond  consists  of  a  row  of  d  points,  sandwiched  between 
pairs  of  rows  having  d  -  2.d  -  4.  -  ■  ■  .4  and  2  pv The  position  point  is  one  of  the  two 
centermost  points. 


Even  though  the  diamond  shaped  c*.  .figurations  described  here  will  not  overlap  if 
they  are  positioned  on  disjoint  points,  the  packing  problem  considered  here  is  not  always 
the  same  as  the  problem  of  packing  such  oriented  diamonds  on  a  q  by  q  torus  shaped  lattice. 
The  problems  are  different  if  d  is  even.  This  fact  can  be  established  easily  for  the  d  ~  2 
case,  since  it  is  clear  that  the  two  “diamonds”  {(r,  s),(r+l,s)}  and  {(r, s+l),(r+l,s+l)} 
do  not  overlap  even  though  the  Lee  distance  between  the  position  points  (r,  s)  and  (r,  s  +  1 
is  less  than  two.  A  similar  argument  can  be  used  to  handle  any  other  even  valued  d. 


Define  limiting  densities  {P2.d}!f=2  by 

Pz.d  =  vz.dTz.d  ( d  >  2). 


RANOOM  PACK  I  NO  AMO  RANDOM  COVERING  SEOUENCES(U) 
STANFORD  UNIV  CA  DEPT  OF  STATISTICS  C  D  SUTTON 
24  INK  N  TR-4R2  MM14-M-K-D13C 
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and  let 

P2.d  = 'l.d^.d  (d  =  2,  •  •  •,12) 

estimate  p^.d-  The  values  of  these  estimates  are  given  below.  Note  that  for  each  even 
value  of  d,  pi.i  is  much  closer  to  Pi,d+\  than  to  fa.d-i- 

Estimates  of  limiting  planar  densities 

£ _ h.d 

2  0.72848 

3  0.69876 

4  0.64197 

5  0.63743 

6  0.61464 

7  0.61331 

8  0.60233 

9  0.60127 

10  0.59317 

1 1  0.59305 

12  0.58866 

Unlike  the  n  =  1  case,  the  pi,d  are  not  given  by  an  expression  similar  to  (2.1); 
however,  several  relationships  among  the  pi.d  and  the  r2,rf  are  apparent.  Letting 

^d  =  Pi,d-\l P2.d+\  (d  =  4,6.8,  •••) 


and 


it  may  be  seen  below  that 


d>d  =  T3'd/Tj.d+ 1  (d>  2), 

I 


A  d  —  1  + 


(d+l)2 

d* 


approximates  and  that 


$a  = 


/0a  (d  even) 

1  +  3  +  jr  ~  yr  (d  odd) 


.*  i»* 


i  *»i  u * 
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It  follows  from  (2.2)  that 


8  [* +  )  ’  a  =  2’3' 


Using 


n  f. +«k+"- 

11  r>L-*\ 


1+E 


4  k*  +  4k  +  1 


i  1  V'  1 

>,  +  i£> 

=  1  +  Kt-SK2 

\  kss  1 

>  1.010226 


n  [1  +  !2i±i*!] 

AM  ^  j 


J  V'  i  f,  ,  (2A-  +  1  )2 
■ exp  1  2-  108  1  +  -75Tir- 


-{1.1^11 

[;  (x- £  *")*B /."(?*?)'■ 


<  1.010476 

it  is  possible  to  obtain  upper  and  lower  bounds  from  (2.3)  which  are  not  too  burdensome 
to  compute.  Given  below  are  upper  and  lower  bounds  of  p2i0O  based  on  the  estimates  f>2,d- 
for  d  equal  to  3,  5,  7.  9,  and  11. 


LI’hllLlWt'l 
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Bounds  of  p2,oc  obtained  from  (2.3) 


(d  =  2 j  -  1) 


lower  bound 
0.5656 


0.5664 


0.5655 

0.5654 

0.5644 


upper  bound 
0.5658 

0.5665 

0.5657 

0.5655 

0.5646 


Alternatively. 


P2.d/p2.oo  =  f - ^—<t>k) 

r_.  \V2,k+i  ) 


suggests  that  pi/x  may  be  approximated  by 


‘260000  ,  v 

n  (— ♦*) 

tJi  V  J 


Note  that  the  product  is  truncated  so  that  it  can  be  evaluated  by  computer.  The  estimates 
of  P2.-x.  obtained  from  (2.4)  and  the  r2j  (d  =  2,  •  •  • ,  12)  are  given  below. 


$ 

$ 
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Approximations  of  P2,x  obtained  from  (2.4) 


d 

(2.4) 

2 

0.5671 

3 

0.5657 

4 

0.5654 

5 

0.5666 

6 

0.5650 

7 

0.5657 

8 

0.5656 

9 

0.5655 

10 

0.5641 

11 

0.5645 

12 

0.5646 

Note  that  for  each  value  of  d ,  the  estimate  obtained  from  (2.4)  is  consonant  with  the 
upper  and  lower  bounds  provided  by  the  previous  method.  Assuming  that  the  variation 
in  the  estimates  is  due  to  chance,  it  seems  reasonable  to  suppose  that  p2 -oc  is  some  value 
close  to  0.565  or  0.566.  It  should  be  noted  that  these  estimates  were  obtained  by  assuming 
that  the  densities  really  do  follow  the  pattern  observed  above.  It  is  not  feasible  to  check 
out  the  accuracy  of  the  approximations  by  performing  Monte  Carlo  experiments  using 
extremely  large  values  for  d. 

Now  instead  of  keeping  n  fixed  at  2,  consider  randomly  packed  4-ary  (n,2)-codes  for 
n  =  3,4,  -  -.  Similar  to  the  corresponding  Hamming  distance  cases,  it  appears  that  the 
ratios  -  log  r2  „/ log02i„  (n  =  3, •■•,7)  are  approximately  constant,  suggesting  that  the 
densities  might  be  asymptotically  equivalent  to 

@2r?  as  n  — .  oo 


for  some  /32.  where  02,n  =  qnp  -  2n  +  1. 


i  rLi‘  «uj  *.|  *|<  ly  4.1  M  w  >,*  *|*  ^  w  »|i  wf 


Af 


s 
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Estimates  of  /32 

n  ~  logr2n/log02,n 

3  0.596 

4  0.597 

5  0.602 


Likewise,  the  corresponding  ratios  for  the  d  -  3  cases  tend  to  approach  a  constant  3 3  as 
n  becomes  large.  Here  #3,n  =  qnp  =  2 n2  +  n  +  1. 

Estimates  of  3% 

n  -  log  r3.n/log  t?3.n 

4  0.760 

5  0.753 

6  0.749 


2.5.  Other  metrics 

Square  Box  Metric 

Consider  packing  on  the  space  of  g-ary  codewords  of  length  2  using  the  metric  given 
by 

rfx,y)  =  max{t/(z1,yi),t/(x2,y2)}, 

where  v  is  a  metric  defined  in  the  previous  section.  If  two  points  collide  whenever  the 
distance  between  them  is  less  than  c,  then  the  packing  sequence  corresponds  to  packing  c 
by  c  blocks  of  points  on  a  q  by  q  torus  shaped  lattice  of  points  so  that  the  packed  blocks 
are  pairwise  disjoint. 

For  various  combinations  of  c  and  q.  the  packing  sequence  described  above  was  re¬ 
peatedly  simulated  by  computer.  The  results  of  the  computer  trials  are  summarized  in 
Appendix  C.  For  example,  the  first  five  cases  given  in  Appendix  C  show  the  results  of 
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packing  2  by  2  blocks  of  points  on  torus  shaped  lattices  of  five  different  sizes.  The  sizes 
range  from  23  by  23  to  35  by  35.  In  each  of  the  five  cases,  it  can  be  seen  that  the  average 
planar  density  observed  does  not  differ  much  from  the  overall  average  of  0.747  obtained 
from  combining  the  results  of  all  of  the  cases. 

Letting  Mc<q  denote  the  number  of  points  in  a  saturated  packing,  the  random  packing 
density  may  be  defined  by 

Pc,q  =  ( c2/q2)E[MC'„ ]. 

Based  on  the  simulation  output  and  the  knowledge  that  an  analogous  metric  in  a  similar 
one-dimensional  setting  produces  limiting  densities,  it  seems  reasonable  to  expect  that 


Pc,q  —  Pc  as  q  ■—  oc 

for  some  sequence  {pc}% 2-  Letting  pc,q  denote  the  observed  average  of  (c2 Jq2).Mcq.  p:  is 
estimated  by  averaging  all  of  the  pc,q  for  which  q  >  10c. 

Similar  to  the  n  =  1  case  of  the  previous  section,  it  is  found  by  curve  fitting  that  the 
pc  can  be  close!}  approximated  by  an  expression  of  the  form 

(2.5)  ^o  +  ^ic"1  +02C'2. 


Performing  a  least  squares  fit  on  the  values  P2,P3,  -  •  ■  ,pg  yields  the  approximation 


0.3142  0.1092 

Pc  25  Pc  —  0.5626  4- - + - r — . 


It  may  be  seen  below  that  this  approximation  formula  does  rather  well.  Also  shown  below 
are  the  least  square  estimates  based  on  the  model 


Pc  —  Poo“t 


which  are  given  by 

Pc  =  0.5622(1.766)1/,c. 

It  should  be  noted  that  while  two  models  yield  fits  of  almost  identical  quality,  the  model 
given  by  (2.6)  has  one  less  parameter  than  the  model  given  by  (2.5). 


myrow 
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Results  for  square  box  metric 


c 

Pc 

Pc 

Pc 

s.d.  ( pc ) 

2 

0.7470 

0.7470 

0.7470 

0.0007 

3 

0.6794 

0.6795 

0.6795 

0.0007 

4 

0.6483 

0.6480 

0.6480 

0.0007 

5 

0.6301 

0.6298 

0.6299 

0.0005 

6 

0.6177 

0.6180 

0.6180 

0.0005 

1 

0.6092 

0.6097 

0.6097 

0.0006 

8 

0.6029 

0.6036 

0.6036 

0.0007 

9 

0.5999 

0.5989 

0.5988 

0.0007 

Note  that  both  models  yield  nearly  the  same  value  for  p2.oc  =  lira—  ^  pc.  The  first 
model  gives  p2.oc  =  0.5626,  while  the  second  model  gives  p2,x  =  0.5622.  p2.x  may  be 
interpreted  as  the  limiting  proportion  of  a  unit  torus  covered  by  a  saturated  packing  of 
squares  of  area  v  as  r  tends  to  zero,  when  the  centers  of  the  squares  in  the  packing  sequence 
arise  from  a  uniform  distribution  over  the  torus. 

The  values  of  the  quantity 


^2  —  y/p2~x  Pl.oc 

produced  by  the  two  estimates  of  p2,x  are  0.0025  and  0.0022.  These  agree  closely  with 
the  estimates  of  A2  obtained  by  other  authors.  Akeda  and  Hori  [2],  Blaisdell  and  Solomon 
[4],  and  Jodrey  and  Tory  [30]  estimate  A2  to  be  0.002“,  0.0025,  and  0.0021,  respectively. 
All  of  these  estimates  serve  to  refute  Palasti’s  conjecture  (see  [44])  that  p2,x  =  p\,x-  ^  *s 
interesting  to  note  that  although  the  quantity  p2,x  discussed  in  the  previous  section  may 
be  interpreted  in  the  same  way  as  p2,x  from  this  section,  its  value  was  estimated  to  be 
slightly  larger  by  the  methods  of  Section  2.4,  being  around  0.565  or  0.566. 

Simple  Cubic  Metric 

Now  consider  the  simple  cubic  packing  sequence  on  the  space  of  ternary  codewords 
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of  length  n.  This  packing  scheme  is  described  in  Section  1.8,  and  has  been  discussed 
previously  by  Itoh  and  Udea  [29]  and  Itoh  and  Solomon  [27]. 

Letting  M„  denote  the  number  of  n-tuples  in  a  saturated  packing  the  random  packing 
density  is  given  by 

Pn  =  2  ~nE[Mn). 

This  follows  from  the  fact  that  the  content  v  associated  with  each  packed  point  is  equal 
to  2"/4"  =  2_n.  It  can  be  seen  below  that  an  expression  of  the  form 

onn~0 / log  n 

fits  the  observed  pTl  rather  well  for  n  not  too  small.  Performing  a  weighted  nonlinear 
regression  on  pc.  •  •  -,/3n  yields  the  approximation 

(1.031995)n 

Pn^Pn-  no.2,S23:log„- 

The  data  is  taken  from  [27],  Note  that  this  model  does  not  have  the  ratio  pn+1  / pn  tending 
to  1  as  suggested  in  [27], 

Comparison  for  the  simple  cubic  packing  scheme 


n 

Pn 

Pn 

s.d.  (pn) 

5 

0.3144 

0.4927 

0.0010 

6 

0.4585 

0.4508 

0.0008 

7 

0.4214 

0.4212 

0.0005 

8 

0.3955 

0.3958 

0.0007 

9 

0.3766 

0.3762 

0.0012 

10 

0.3625 

0.3631 

0.0018 

11 

0.3519 

0.3516 

0.0013 

2.6.  Complementary  codes 

In  Section  2,  random  binary  (n,t/)-codes  were  formed  by  adding  randomly  chosen 
words  to  a  packed  code  if  the  selected  word  was  at  a  Hamming  distance  of  d  or  more  from 
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each  word  already  in  the  packed  set.  This  section  will  also  consider  stochastically  formed 
binary  (n.d)-c odes:  however,  now  the  criteria  for  packing  new  words  will  be  different.  Be¬ 
fore  describing  the  packing  scheme  and  presenting  the  results,  a  few  additional  comments 
concerning  error-correcting  codes  will  be  given. 

For  fixed  n  and  d  it  is  clear  that  information  can  be  transmitted  at  the  greatest 
rate  whenever  the  number  of  words  in  the  code  is  as  large  as  possible.  A(n.d)  denotes 
the  number  of  codewords  in  the  largest  possible  binary  code  of  length  n  and  minimum 
distance  d.  A(n.d)  is  known  precisely  in  some  cases,  while  in  other  cases  only  upper  and 
lower  bounds  are  known.  MacWilliams  and  Sloane  [38]  give  relations  involving  A(n.d). 
discuss  the  bounds  on  A(n,d).  and  give  values  where  known.  Their  book  also  serves  as 
an  excellent  reference  on  the  coding  problem  in  general,  as  do  the  books  by  Peterson  and 
Weldon  [-45]  and  Lin  and  Costello  [35] . 

Several  of  the  most  familiar  error-correcting  codes  exhibit  quite  a  lot  of  structure 
and  contain  the  maximum  number  of  Codewords,  A(n.d).  Perhaps  the  best  known  of 
such  codes  is  the  Golay  codes.  This  code  has  length  24  and  minimum  distance  8.  It  has 
.4(24,8)  =  4096  codewords  and  the  Hamming  distance  between  any  pair  of  codewords 
is  either  8,  12.  16.  or  24.  The  Golay  code  belongs  to  the  class  of  linear  codes  and  has 
numerous  interesting  structural  properties,  see  Thompson  [58].  Among  these  properties  is 
the  fact  that  if  u  is  a  codeword  then  so  is  u*,  where  u*  is  the  complement  of  u.  i.e.  u*  has 
a  one  in  each  position  where  u  has  a  zero  and  vice  versa.  Codes  possessing  this  property 
will  be  called  complementary  codes. 

For  various  combinations  of  n  and  d,  complementary  codes  were  stochastically  gener¬ 
ated  using  a  computer.  Each  of  the  resulting  codes  possesses  the  additional  property  that 
the  Hamming  distances  between  all  pairs  of  words  belong  to  a  restricted  set  of  values.  A 
code  word  of  this  type  having  words  of  length  n  can  arise  from  the  constrained  random 
packing  of  binary  ( n  -  l)-tuples  using  a  procedure  which  will  be  outlined  below. 

Itoh  (see  [26].  [28] )  shows  that  it  is  not  too  difficult  to  stochastically  generate  a  (24. 81- 
code  of  size  .4(24,8),  and  in  about  189?  of  the  possible  complementary  coding  schemes 


£ 
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with  4  <  n  <  12  at  least  one  random  packing  resulted  in  a  code  of  size  A(n.d)  being 
formed.  Naturally,  the  unrestricted  packing  scheme  of  Section  2  may  also  produce  codes 
of  size  A(n,d)\  however,  this  was  observed  in  only  a  few  of  the  low  dimensional  cases.  In 
some  cases  (for  example,  (n.d)  =  (24.8))  it  has  been  shown  that  any  random  code  of  size 
A(n.d)  must  be  equivalent  to  an  algebraic  code  of  size  A(n.d)  possessing  nice  properties. 
However,  it  has  not  been  established  that  all  random  codes  of  size  A{n,d)  are  equivalent 
to  algebraic  codes. 

Thus  it  may  be  possible  to  find  large  codes  by  restricted  random  packing,  and  given 
the  close  connection  between  error-correcting  codes  and  the  packing  of  n-dimensional 
spheres  (see  [34]).  denser  sphere  packings  are  also  possible.  It  is  interesting  to  note  that 
extremely  dense  packings  have  also  been  generated  stochastically  via  simulated  annealing 

(see  [13]). 

Now  the  packing  procedure  will  be  described.  Letting  AC  denote  the  set  of  allowable 
interword  distances,  require  thatr'Al  be  of  the  form 

{d.  di,  ■  ■  ■  ,dm.  n  -  dm,  ■  ■  ■ .  n  -  dy,  n  -  d.  n} 

where  d  <  d\  <  ■  ■  •  <  dm  <  n/2.  Sequentially  choosing  (n  —  l)-tuples  at  random  from 
the  set  of  all  binary  ( n  -  1  (-tuples,  a  new  selection  is  added  to  the  packed  collection  if 
the  Hamming  distances  between  it  and  all  previously  packed  selections  are  elements  of  AC. 
Continuing  until  it  is  no  longer  possible  to  add  to  the  packed  set.  codewords  of  length  n 
are  formed  from  the  ( n  -  1  (-tuples  by  adding  a  zero  as  the  nth  component.  These  words, 
plus  their  complements,  constitute  an  (n,  (/(-complementary  code  having  the  property  that 
the  distance  between  any  pair  of  words  belongs  to  AC. 

Every  such  complementary  code  having  4  <  n  <  12  and  2  <  d  <  n/2  was  repeatedly 
generated.  In  each  case  the  average  observed  packing  density  was  calculated.  For  n  and 
d  both  even,  the  densest  codes  arose  from  AC  containing  n  and  all  even  integers  between  d 
and  n  —  d  inclusive.  In  these  cases  the  complementary  codes  formed  are  denser  than  the 
corresponding  (n.d)- codes  constructed  by  the  method  of  Section  2.  For  all  other  choices 
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of  n  and  d.  the  densest  complementary  codes  were  generated  by  having  AJ  =  {d.d  -e 
1.  •  •• ,  n  ~  d  -  1.  n  -  d,  n}.  The  simulation  results  are  summarized  in  Appendix  D.  Since 
the  random  complementary  coding  scheme  generally  produces  large  codes  more  frequently 
than  the  unrestricted  random  coding  scheme,  and  since  for  the  same  choice  of  (n.d)  the 
complementary  codes  can  be  produced  more  quickly  by  computer  (remember.  ( n  - 1  (-tuples 
are  being  packed  in  the  complementary  case),  it  seems  that  random  complementary  coding 
may  be  superior  to  unrestricted  random  coding  in  the  search  for  m  w  large  codes. 

Simulations  were  avoided  for  the  cases  covered  by  the  two  results  below.  These  facts 
follow  from  the  following  lemma  whose  simple  proof  is  omitted. 

Lemma  2.1.  The  Hamming  distance  between  a  codeword  of  even  Hamming  weight 
and  a  codeword  of  odd  Hamming  weight  is  odd.  Otherwise  the  Hamming  distam  .  between 
two  codewords  is  even. 

Fact  2.1.  Stochastically  constructed  ( n.  r/ (-complement  ary  codes  with  AJ  =  {d.du 
■  ■  ■ ,  dm.  n  -  dm.  ■  ■  ■ .  n  -  d\ ,  ti  -  d.  n},  where  d.  dx.  ■  ■  ■ ,  dm.  n  -  dm.  ■  ■  • .  n  -  d\  and  n  -  d  are 
all  odd,  will  contain  exactly  four  codewords. 

Fact  2.2.  If  n  is  even,  then  a  stochastically  constructed  (n, 2)-complementary  code 
with  K.  containing  all  even  integers  between  2  and  n  —  2  inclusive  will  contain  exactly 
A(n.d)  =  2n_1  codewords. 

For  the  class  of  (  n,  2)-codes  having  AJ  =  {2,  3,  •  ■  ■ ,  n  —  3,  n  —  2,  n},  a  model  similar  to 
those  of  Sections  2  and  3  seems  to  fit  the  observed  densities  for  n  not  too  small.  That  is. 
letting  $2.n  equal 

2n_1  /’{two  arbitrary  ( n  -  1  (-tuples  collide}  =  n  +  1. 
it  is  found  that  the  densities  are  reasonably  approximated  by 
(2.7)  ^"n  6147  -  0.13696>2.^(0  614T> 

for  n  >  9. 

Results  for  cases  with  K  —  { tj }  =  {2,  3.  -  •  ■ .  n  -  2} 
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n 

f 

(2.7) 

7 

0.290 

0.289 

8 

0.272 

0.268 

9 

0.251 

0.251 

10 

0.235 

0.236 

11 

0.225 

0.224 

12 

0.212 

0.212 

13 

0.204 

0.203 

14 

0.193 

0.194 

Similarly,  letting  AC  =  {3. 4.  •  •  ■ ,  n  -  4,  n  -  3.  n}  produces  densities  that  are  approximated 
by 


(2.8) 


/j-0”S3 

"3.n 


+ 


4.3400 


-2(0.7753) 

3,n 


for  n  >  11,  where 


n2  +  n  +  2 
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Results  for  cases  with  K  -  { n }  =  {3.  -1.  •  • .  n  -  3} 


r? 

r 

(2.8) 

10 

0.0526 

0.0462 

11 

0.0448 

0.0450 

12 

0.03S7 

0.0385 

13 

0.0339 

0.0335 

14 

0.0300 

0.0305 

2.7.  Summary 

For  randomly  packed  g-ary  (n.  decodes,  evidence  has  been  presented  which  suggests 
that  as  the  dimension  rj  tends  to  infinity,  the  center  densities  are  asymptotically  equivalent 
to  expressions  of  the  form 

(p/v)~3d. 

Here  p  is  the  probability  that  two  arbitrary  codewords  collide,  and  r  =  q~n  is  the  content 
associated  with  each  point  in  the  space.  This  was  demonstrated  most  convincingly  for 
the  case  of  binary  codewords  packed  by  Hamming  distance,  where  additional  relations 
involving  the  3i  were  also  found.  Support  was  also  given  for  some  q-ary  cases,  and  for  a 
case  where  Lee  distance  was  used  as  the  metric  instead  of  Hamming  distance. 

Relations  among  the  packing  densities  were  also  found  in  cases  where  n  was  set  equal 
to  2  and  q  was  taken  to  be  large.  One  such  two-dimensional  scheme  yielded  an  estimate 
of  the  planar  packing  density.  which  agrees  closely  with  estimates  obtained  by  other 
investigators.  The  estimate  is  the  limiting  value  of  a  formula  which  was  shown  to  closely 
approximate  the  packing  densities  arising  from  a  class  of  discrete  two-dimensional  packing 
schemes.  This  method  of  finding  P2,oc  differs  from  those  used  previously.  It  is  interesting 
to  note  that  the  formula  is  of  the  same  form  as  Mackenzie's  formula  for  the  class  of  one¬ 
dimensional  analogs.  Hopefully,  similar  simulations  may  be  done  for  the  three-dimensional 
case,  and  the  limiting  density  obtained  may  be  compared  with  the  results  reported  in 
Blaisdell  and  Solomon  [5], 
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Packing  Times  and  Covering  Times 


3.1.  Introduction 

In  Chapter  2  the  time  to  saturation,  T,  was  defined  for  a  packing  sequence  on  a 
metric  space  (5,/r)  with  collision  criterion  <5.  For  a  given  packing  sequence,  T  is  just  the 
total  number  of  random  independent  selections  from  5.  including  rejections,  required  to 
achieve  a  state  of  saturation.  For  convenience,  T  will  sometimes  be  called  the  packing 
time. 

It  is  also  possible  to  define  another  random  variable  associted  with  the  process  of 
sequentially  selecting  points  from  the  space  5  of  a  metric  space  (5,/z).  For  each  element 
s  €  5.  define  a  coverage  set  .4(s)  such  that  if  X  is  uniformly  distributed  over  5  then 
P{ X  €  A(s)}  =  e,  for  some  c  >  0.  A(s)  can  be  a  neighborhood  of  s,  but  this  is  not 
necessary.  Suppose  that  C\,C2 ,  •••  are  chosen  uniformly  and  independently  from  5  and 
let  H\  called  the  covering  time,  be  defined  by 


IV  =  min 


|*6  {1,2,..-}  :5C  |J  A(C,)j. 


It  is  said  that  S  is  covered  by  [J^=1  A(Cj)  for  any  j  >  VV.  Equivalently,  it  is  also  said  that 
the  sequence  of  points  Ci,C2,---,Cj  ( j  >  W)  provide  a  covering  of  the  space.  Random 
coverage  problems  are  abundant  in  the  literature,  see  for  example  [7],  [16],  [17],  [18],  [19], 
[20],  [24],  [25],  [40],  [42],  [49],  [50],  [51],  [54],  [57],  and  [59], 

Below,  packing  and  covering  sequences  are  examined  on  several  spaces.  In  each  space, 
coverage  sets  are  defined  so  that  for  some  choice  of  n  and  6  the  packing  sequence  corre¬ 
sponds  to  randomly  packing  the  coverage  sets  so  that  members  of  the  packed  collection 
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are  pairwise  disjoint.  The  distributions  of  the  random  variables  T  and  IT  are  compared 
in  each  case. 

3.2.  The  Continuous  Circle 

Consider  the  sequential  random  packing  of  arcs  of  length  a  <  1  on  a  circle  of  unit 
circumference.  Random  points  Ct,C2,  •  •  •  sequentially  chosen  from  S  =  (0, 1]  serve  as  the 
midpoints  of  arcs  of  length  a.  The  collision  criterion,  6 ,  is  set  equal  to  a,  and  n  is  taken 
to  be  the  metric  described  in  Section  1.4.  For  this  setting.  Renyi  [47]  has  shown  that 
the  expected  proportion  of  the  circle  covered  by  packed  arcs  at  time  T  approaches  the 


constant 


as  the  length  of  the  arcs  tends  to  zero. 

It  will  now  be  shown  that  E[T\  =  oc  whenever  a  <  To  do  this  it  is  convenient  to 
first  define  a  few  terms.  When  a  second  arc  is  fairly  packed  on  the  circle,  two  gaps  having 
lengths  summing  to  1  -  2a  are  formed.  Let  G  denote  the  shorter  of  the  two  gaps,  and  let 
L  denote  its  length.  Note  that  L  is  uniformly  distributed  over  (0,  |  -  a). 

Now  define  a  random  variable  U  as  follows.  If  L  <  a,  let  U  =  0.  If  L  >  a,  let  U  =  k 
if  Cfc  is  the  midpoint  of  the  first  arc  which  is  packed  into  G. 

Note  that  the  first  random  arc  will  always  be  packed  onto  the  circle,  and  that  the 
expected  number  of  additional  selections  needed  to  pack  a  second  arc  onto  the  circle  is 
Y~2a.  ■  Then  given  that  L  =  x  >  a,  the  probability  that  a  random  arc  is  packed  into  G  is 
just  x  -  a.  Hence,  for  x  >  a  it  follows  that 
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Clearly  T  >  U.  so  that  for  any  a  <  \  and  any  s  between  0  and  5  -  2a  it  follows  that 


E[T }  >  E[U] 


> 


dx 


Since  this  inequality  is  true  for  all  e  €  (O.j  -  2a)  it  follows  that  E[T]  =  x.  Thus  the 
following  theorem  has  been  proved  for  the  case  of  a  <  The  proof  for  the  remaining 
values  of  a  is  similar. 


Theorem  3.1.  For  the  packing  sequence  of  arcs  of  length  a  on  a  circle  of  unit 
circumference,  if  a  <  g  then  E[T]  —  oc. 


Now  consider  the  sequential  covering  of  the  circle  with  random  arcs  of  length  a.  and 
let  M(a)  =  [a"1].  Stevens  [57]  determined  that 

M(a) 

(3.1)  P{W  <k}  = 

;=o 


.1(1  —  ja) 


k- 1 


and  Flatto  and  Konheim  [17]  used  (3.1)  to  show  that 


A  f(i) 

E\\v]  =  i-  53(-i)fc 


k=\ 


(1  -  ka)k~l 

( ka)k+ 1 


and 


(3.2) 


~  o'1  logo-1  as  a  j  0. 


Adopting  the  convention  that  the  ratio  of  any  finite  number  over  infinity  is  equal  to  zero, 
note  that  for  any  arclength  a  <  3, 


E[W] 

E[T) 


=  0. 


Now  consider  randomly  packing  and  covering  the  circle  with  arcs  of  variable  length. 
That  is,  suppose  arc  midpoints  C1.C2,  ■  •  •  are  selected  from  5  =  (0. 1]  as  before,  and  let 
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the  arc  centered  on  C,  have  random  length  Z,.  The  lengths  Zi,Z2-"'  are  taken  to  be 
i.i.d.  random  variables  having  c.d.f.  F. 

The  coverage  problem  can  be  described  as  before,  only  now  the  coverage  sets 

A,  =  {s  £  5  :  fj(s,  C,)  <  Z,/ 2}  (*  =  1,2,  •  •  •) 

have  random  size.  The  packing  problem  can  be  described  as  follows.  The  coverage  set  .4, 
is  added  to  the  collection  of  packed  sets  if  and  only  if  it  does  not  intersect  any  previously 
packed  member  of  the  collection.  The  packed  collection  is  considered  to  be  saturated 
whenever  the  probability  that  an  additional  member  can  be  added  is  zero. 

Siegel  and  Holst  [51]  derived  an  expression  for  the  probability  that  the  circle  is  com¬ 
pletely  covered  by  arcs  of  variable  length.  For  the  case  of 


F(x)  —  x  (0  <  x  <  1), 


their  result  may  be  written 
(3.3) 


with  the  sum  being  over  all  sets  of  non-negative  integers  mj,  •  •  • ,  m:  such  that  £ ?=i  = 

k  -  j.  For  this  case  of  random  arcs  having  mean  length  it  may  be  shown  that  P{W  <  k} 
is  greater  than  the  corresponding  probability  for  arcs  of  fixed  length  5.  This  is  accom¬ 
plished  by  noting  that  the  first  two  terms  of  (3.3)  equals  P{W  <  k }  for  the  arcs  of  equal 
length,  and  then  showing  that  the  sequence  62, 63,  •  •  •  constitutes  a  monotonically  decreas¬ 
ing  sequence.  It  follows  that  E\}V]  <  00  for  the  random  arc  lengths,  since  the  expectation 
is  finite  for  the  constant  length  arcs. 


where 


For  F(x)  =  x  it  is  clear  that  with  probability  1  the  packed  set  will  not  become 
saturated.  This  follows  from  the  fact  that  with  probability  1  each  newly  packed  arc  will 
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be  situated  such  that  there  are  open  gaps  of  positive  length  on  both  sides  of  it.  and  with 
positive  probability  an  additional  arc  of  sufficiently  small  size  can  be  packed  into  each  gap. 

Now  suppose  that  F  is  such  that 

P{Z,  >  a}  =  1 


for  some  a  >  0.  Then  it  is  clear  that  £[11']  <  oc  since  the  expectation  is  finite  in  the  case 
of  arcs  having  constant  length  a,  because  the  arcs  of  variable  length  will  cover  the  circle 
at  least  as  quickly  as  the  arcs  of  length  a  do. 


It  is  still  possible  to  have  E[T]  being  infinite  in  this  case.  For  instance,  suppose  F  is 
the  uniform  distribution  on  ( a.b ),  where  0  <  a  <  b  <  Then  with  positive  probability  the 
second  arc  selected  will  form  a  gap  G  of  length  L  6  (a,  6).  and  given  that  this  event  occurs. 
L  will  be  uniformly  distributed  over  (a, 6).  Given  that  L  =  x  £  (a.b).  the  probability  that 
an  arbitrary  arc  will  be  packed  into  G  is 


1 

(b-a) 


-  z)dz  — 


(g  ~  Q)2 
2  (b-a)' 


It  then  follows  that  the  expected  number  of  attempts  required  to  pack  the  gap  with  an 
arc,  given  that  a  <  L  <  b.  is 


—  /‘ 
(b  ~  a)  Ja 


2(6-  a) 
(x  -  a)2 


dx  =  oc. 


These  facts  are  sufficient  to  ensure  that  E[T]  =  oc. 


3.3.  Interarrival  times 


Let  the  random  variables  Ti,T2,...  be  defined  as  follows.  If  saturation  occurs  before 
the  Jtth  point  is  added  to  the  packed  set,  then  7\  =  oc.  Otherwise,  Tk  =  j  if  C3  is  the  Arth 
point  packed. 

Now  define  interarrival  times  L\ ,  U2.  •  •  •  by 
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and 


Vk  = 

Note  that  P{L\  =  1}  =  1  and 


Tk  -  Tk-\  if  Tk  <  oc 
oc  if  Tk  —  oc. 


P{U2  =  k}  =  pk~'(l-p)  (k  =  1,2,...), 


where 


p  =  P{Ci  A  Cj]  (i#  1). 


For  the  case  of  packing  arcs  of  length  o  on  a  circle  of  unit  circumference. 

E[U  i]  =  1, 


E[l^=— ~a  <a<1/2)’ 


and  for  a  <  1/4 


«-)"(/:  rrfc-f  r 

_21»e(ieS)  +  i 


-  4a 


(1  -2a) 
1 


1  -  4a  +  a2  +  0(a3)’ 


For  a  <  1/6 


£1^1 -(i-)  (J‘  At,)d,  +  J*  m,)d,  +  £  +  £  V>. 


where 


>.(*)  =  (1 (i 

s(l)  =  (1_4ar.fi^£  +  7^_  +  2 
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It  may  be  shown  that 

E[U4)  =  1  +  6a  +  33a2  +  0(a3). 

and  also  it  is  true  that 

1  -  6a  +  3a2  +  0(a3) 

For  k  >  5,  an  expression  for  E[i'k]  is  difficult  to  obtain.  For  fixed  k , 

however,  for  any  particular  value  a  <  [2(fc  -  l)]-1  the  approximation 

1 

1  -  2(  k  -  1  )a 

overestimates 

3.4.  The  discrete  circle 

Consider  the  sequential  random  packing  of  arcs  of  length  c  on  the  discrete  circle 
of  circumference  n.  Points  C1.C2  •••  are  uniformly  and  independently  selected  from 
5  =  {1,2,  •  •  • .  n}.  Letting 

f>  -  !/l  if|x-y|<n/2 

p(i.y)  =  •{ 

(  n  -  |x  -  y|  otherwise. 

two  arcs  with  clockwise  endpoints  C,  and  C:  will  overlap  if  and  only  if 

n(C^Cj)  <  c. 

Similarly,  the  circle  can  be  sequentially  covered  with  random  arcs  of  length  c  by  letting 
the  coverage  sets  be  given  by 

A(s)  =  {i,J©l,‘",«6(c-  1)}. 

where  the  subtraction  is  modulo  n. 

Note  that  for  c  =  1  the  circle  is  packed  if  and  only  if  the  circle  is  covered.  Thus  the 
random  variables  T  and  IF  have  the  same  distribution.  Since  the  circle  is  not  covered 
until  each  member  of  5  has  been  selected  at  least  one  time,  and  because  the  waiting  times 
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between  the  initial  occurrences  of  elements  of  S  have  geometric  distributions, 
argument  yields  that 


n  —  1 


C[W)  =  £ 

*= 0 


n 

n  —  k 


an  easy 


It  follows  that 


(3.4) 


£(U']~n  log  n  as  n  — ►  oc. 


Since  £[ir]  is  finite. 


for  the  case  of  c  —  1. 


£[ir] 

E\J] 


The  distribution  of  H\  or  equivalently  of  T.  may  be  determined  in  this  case  by  making 
use  of  the  method  of  inclusion-exclusion.  It  is  found  that 

P{\V  <  k}  =  J](-iy(")(l-;7n)‘  (*  >  n), 

j=0 

and  further,  if  n  is  not  too  small,  it  can  be  seen  that  the  approximating  expression 


PSW  <  k)  a  exp( -ne~  «  ) 


does  rather  well  (see  [14’b  The  related  case  with  selections  from  5  not  being  uniform  is 
discussed  by  Flatto  and  Newman  [18];  however,  they  do  not  obtain  the  distribution  of  U' 
exactly. 

For  c  >  2  the  distributions  of  T  and  W  are  difficult  to  obtain.  The  determination  of 
P{IF  >  k}  by  a  scheme  analogous  to  Stevens'  method  for  the  continuous  circle  becomes 
unwieldy  due  to  the  fact  that  now  two  or  more  arcs  may  exactly  coincide  with  positive 
probability.  However,  upper  and  lower  bounds  for  P{W  <  k }  and  £[1F]  may  be  obtained 
via  a  method  based  upon  an  idea  of  Cooke  [7]. 

Consider  a  circle  of  circumference  n  =  cm  for  some  integer  m,  and  let  Gj.  G 2,  •  ■  • .  Gm 
be  a  partition  of  5  given  by 

Gj  =  {(c  -  l)j  +  l.(c  -  l)j  +  2.  •  •  -  ,c;  -  l.c;}  (j  =  1,  •  •  •  ,m). 


\ 
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Note  that  if  5  is  covered  by  k  random  arcs  of  length  c.  then  each  set  in  the  partition  must 
contain  at  least  one  clockwise  endpoint  C,  ( i  £  {1,  •••,£}).  So  if  WG  denotes  the  first 
time  for  which  each  member  of  the  partition  contains  at  least  one  endpoint,  it  follows  that 
W  g  <  H’.  This  fact  and  the  principle  of  inclusion-exclusion  yield  that 

P{\V  <  k)  <  P{\VG  <  k} 


=  (m)(1  ~  J/Tr>)k 

j=0  \  J  / 


Note  that  U‘G  has  the  same  distribution  as  H  did  for  the  case  where  c  =  1  and  $  — 
Hence  it  follows  that 

£[»']  >  r[H'c] 

m  —  1 

\  ■>  77? 

m  -  k  ' 

*= o 

which  is  asymptotically  equivalent  to  m  log  m  as  m  tends  to  infinity.  Since  m  =  n/c  it 
follows  that  j  has  a  lower  bound  which  is  asymptotically  equivalent  to 


n  n 

-  log  —  as  n  — >  oc . 
c  c 


Now  consider  a  circle  of  circumferene  n  =  ( )  m  {m  £  {1.2.---}).  and  a  parti¬ 
tion  of  5  given  by 

Hj  =  {(o'  -  l)j  +  l,(c'  -  1  )j  +  2,  ••• , c'j  -  1 , c'j }  (j  =  1,  •  •  -.m) 

where  c'  =  Note  that  if  each  set  in  the  partition  contains  at  least  one  clockwise 

endpoint,  then  the  circle  is  completely  covered  by  arcs  of  length  c.  So  if  W'h  denotes  the 
first  time  for  which  each  member  of  the  partition  contains  at  least  one  endpoint,  then 


\VH  >  If.  Thus 


P{ ir  <k}>  p{\vH  <  k} 


n  \  J  / 


>  m ), 


't'&Z+Zt 
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and 


E\W]  <  E{ WHl 


m  —  1 


Em 
m  — 


k—0 


m 

m  —  k 


This  implies  that  E[W]  has  an  upper  bound  which  is  asymptotically  equivalent  to 


im 105  tm 


as  n  —  oo. 


Consider  again  the  circle  of  circumference  n  =  cm,  the  partition  G\,G 2.  •  •  • ,  and 
the  random  variable  Wq  defined  previously.  Note  that  for  a  packing  sequence  of  random 
arcs  of  length  c.  saturation  cannot  occur  until  at  least  one  point  has  been  selected  from 
every  set  in  the  partition.  Hence  \VG  <  T  and  E[T]  >  £[H’<j].  Thus  E[T ]  has  a  lower 
bound  which  is  asymptotically  equivalent  to 


as  n  — >  oc  . 


An  upper  bound  for  E\T)  can  be  obtained  as  follows.  Note  that  whenever  an  element 
s  €  5  =  { 1 ,  -  *  - ,  n}  is  selected  as  a  clockwise  endpoint  of  an  arc,  either  the  arc  will  be 
rejected  or  the  arc  will  be  added  to  the  packed  collection  of  arcs.  If  the  arc  is  rejected, 
then  subsequent  selections  of  that  element  s  will  also  lead  to  rejection.  If  the  arc  is  packed, 
then  subsequent  selections  of  the  same  element  will  lead  to  rejection.  Hence  it  is  clear  that 
as  soon  as  each  element  of  5  has  been  selected  at  least  once  then  no  further  arcs  will  be 
added  to  the  packed  collection.  Letting  IVn  denote  the  first  time  for  which  each  element 
of  5  has  been  selected  at  least  once,  it  then  follows  that  T  <  W’n.  Thus  E[T]  <  £[H'„j, 
and  so  E[T]  has  an  upper  bound  which  is  asymptotically  equivalent  to 


nlogn  as  n  — *  oc. 


Note  that  the  upper  and  lower  bounds  for  £’[11']  and  E[T]  do  not  indicate  which 
expectation  is  smaller.  Since  it  is  the  case  that  IV  <  T  for  some  outcomes  of  C].C2.  ■  •  • 
while  IV  >  T  for  others,  a  simulation  study  was  done  to  see  whether  or  not  a  relation  like 
£[!V]  <  E[T]  seems  to  hold  in  general.  2000  trials  were  performed  for  each  of  the  covering 
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cases,  and  1000  trials  were  done  for  each  of  the  packing  cases.  The  results  are  shown  in 
the  two  tables  below. 


c 

n 

ir 

T 

2 

200 

580.4425 

785.470 

2 

300 

938.2030 

1307.250 

2 

400 

1311.2655 

1829.266 

2 

600 

2079.9205 

2992.124 

2 

800 

2894.2290 

- 

r 

r? 

W 

T 

3 

300 

611.3685 

1006. 3S9 

3 

450 

979.7920 

1694.828 

3 

600 

1371.6885 

2417.914 

3 

900 

2174.7680 

4018.000 

3 

1200 

3032.0445 

— 

It.  every  case  above,  it  appears  that  E[\V]  <  E[T}\  however,  the  case  of  n  =  8 
ar.d  c  =  2  shows  that  this  is  not  true  in  general  since  there  direct  calculations  give 
F.  ’U  =  151. '15  and  E[T)  =  147/15.  It  may  also  be  seen  from  the  results  above,  that  for 
n  sufhnent'w  large.  £'U  1  appears  to  be  proportional  to 


niogf^ 


d!  :  /  7  appears  to  be  proportional  to 


"lo6(^)  • 


f«>r  the-e  observations  is  presented  below.  A  few  additional  covering  results  for 
•  \  r  -  ha\<  been  added 
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c 

71 

lf’/(  3.5 ) 

77(3.6 

2 

200 

0.509 

1.004 

2 

300 

0.512 

1.009 

2 

400 

0.512 

0.993 

2 

600 

0.510 

0.995 

2 

800 

0.510 

- 

3 

300 

0.357 

0.957 

3 

450 

0.356 

0.963 

3 

600 

0.357 

0.960 

3 

900 

0.355 

0.969 

3 

1200 

0.356 

- 

4 

400 

0.278 

- 

4 

600 

0.278 

- 

4 

800 

0.279 

- 

5 

500 

0.225 

- 

5 

750 

0.226 

- 

5 

1000 

0.225 

- 

The  interarrival  times  for  adding  new  arcs  to  the  packed  collection  are  similar  to 
those  for  the  continuous  circle,  and  they  provide  useful  anchor  points  for  checking  the 
simulation  programs.  For  the  continuous  case,  where  p  =  2a,  it  can  be  shown  that 


E{U,} 

E[U2] 

E[Uz] 

and 

E[  r«i 


i, 

i 

i-p' 

i 

1  -  2p(l  -  p/8)  +  0(p3) 

_ 1 _ 

1  -  3p(  1  -  p/4)  +  0(p3)' 


3 


4 


■e  v  v  , 
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For  the  discrete  circle  case,  where  p  =  (2c  -  1  )/n. 


£[£’i]  =  1  (n  >  c), 


E[U2)  = 


n  -  (2c  -  1)  1  -  p 


(n  >  2c), 


and  for  n  >  4(c  -  1 ) 


E(V  1  =  -  ~4c  +  3  n  ,  V-  " 

3‘  n  -  (2c  -  1)  n  -  (4c  -  2)  n  -  (2c  -  1)  "  n  -  (3c  -  1  +  j) 


For  c  =  2.  3. 4. 5  it  can  be  shown  that 


£[£3]  = 


1  -2p(l-p/8)  +  0(p3) 


as  was  true  for  the  continuous  case.  It  is  not  unreasonable  to  expect  that  this  expression 


holds  for  other  values  of  c  as  well. 


For  c  =  2  it  can  be  shown  that 


£(r4]  = 


n  —  (9  -  6/n)  +  0(n~2 ) 

_ 1 _ 

i  -  3p(i  -  Ip)  +  Oip3)' 


and  for  c  -  3  it  can  be  seen  that 


£(£4]  = 


n  -  (15  -  18/n)  +  0(n~2) 
1 


l-3p(l-  ftp)  +  Olp3) 


These  results  suggest  that 


1  —  3p(  1  —  p/4 ) 

might  approximate  £(£4]  reasonably  well  for  c  >  4  and  n  not  too  small. 


3.5.  Multidimensional  spaces 


Consider  the  metric  spaces  (Sn,pn)  (n  =  1.2.---).  where 


5n  =  {(«,.•• -,un)  :  u,  €  {1.2,3}) 


Section  3.5  Multidimensional  space>  119 


and  pn  is  the  metric  given  by 


/i«(x,y)  =  max  jx,  -  y,\. 

l<t<n 


These  simple  cubic  metric  spaces  are  equivalent  to  the  ones  described  previously  in  section 
1.8  and  section  2.5.  The  collision  criterion.  <5,  is  always  taken  to  be  2. 

The  packing  sequence  corresponds  to  letting  the  sequence  Cj ,  C2.  •  •  ■  of  points  from 
Sn  represent  the  centers  of  n-dimensional  cubes  of  sidelength  2.  The  sides  of  these 
cubes  are  aligned  with  the  sides  of  the  n-dimensional  cube  of  sidelength  4  having  ver¬ 
tices  {(uj.  •  •  • ,  tin)  :  u,  6  {0.4}}.  Since  two  random  points  C,  and  Cj  collide  if  and  only 
if  Hn(C,,Cj)  <  2.  it  follows  that  two  points  are  disjoint  only  if  their  surrounding  boxes  of 
sidelength  2  do  not  overlap. 

The  coverage  sequence  can  be  described  as  follows.  Let  UTi  denote  the  n-dimensional 
cube  of  edgelength  4  which  contains  the  3n  elements  of  5„  in  its  interior.  Then  letting 
A{CX )  denote  the  box  of  sidelength  2  centered  on  C,.  the  covering  time  U'  is  the  smallest 


integer  xv  for  which 


Un  =  Q^(C\). 


Note  that  this  definition  of  If  differs  slightly  from  the  one  given  in  section  3.1.  because 
the  space  being  covered.  Un,  is  not  the  same  as  5n. 

Let  S'n  denote  the  set  containing  the  2"  corner  sites  in  Un.  Thus 

S'n  =  {(1/1, •••,«„)  :  u,  G  {1.3}}. 

Note  that  Un  is  covered  if  and  only  if  each  member  of  S'n  has  been  selected  at  least  one 
time.  Hence  it  follows  that 

m]-t^ 

~  3n  log  2n  as  n  — >  oc. 

Now  let  A/n  denote  the  number  of  points  in  a  saturated  packing.  Note  that  just 
before  the  A/nth  point  was  packed,  there  was  at  least  one  site  available  in  which  a  point 
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could  be  packed.  Similarly,  just  before  the  (A/n  -  l)th  point  was  packed  there  were  at 
least  two  a\'ailable  sites,  and  in  general  there  are  k  +  1  or  more  available  sites  just  previous 
to  the  packing  of  the  ( Mn  -  &)th  point. 


For  0  <  k  <  Mn  -  1,  let  Z„  be  the  number  of  trials  required  to  place  the  (Mn  -  Jt)th 
packed  point,  and  for  Mn  <  k  <  2"  let  Z„  equal  It,  follows  from  comments  above 

that 


E[Zk]  < 


3" 

(fc+1)' 


Noting  that  1  <  Mn  <  2n,  it  also  follows  that 


2n  —  1 


r  - 


and  so 


k=  0 


2"  — 1 


E[T]  <  Y,  E\Zk) 

k= 0 
2n  — 1 

*  1  on 


=  E[W}. 

It  is  also  possible  to  show  that  E[T]  <  £"[H']  and  T  <  almost  surely.  A  thorough  proof 
of  this  last  fact  is  rather  tedious,  but  not  difficult. 


For  1  <  n  <  5  £"[11 ']  was  calculated  exactly.  For  1  <  n  <  2,  E[T]  was  calculated 
exactly,  and  for  3  <  n  <  5  E[T ]  was  estimated  using  simulation  results.  For  3  <  n  <4. 
50,000  trials  were  done  in  each  case,  and  10,000  trials  were  performed  for  the  case  n  =  5. 
These  results  are  presented  below. 


n 

E[W] 

E\T) 

E[W]/E[T) 

1 

4.5 

3 

1.5 

2 

18.75 

10.2 

1.84 

3 

73.38 

37.60 

1.95 

4 

273.8 

142.3 

1.92 

5 

986.2 

541.2 

1.82 
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Cooke  [7]  has  considered  sequential  coverage  sequences  for  other  multidimensional 
spaces.  From  Miles'  results  dealing  with  Poisson  point  processes  (see  [40]  and  [41]).  Cooke 
arrives  at  expressions  of  the  form 

~  7i  P  log  72  P  as  p[  0, 


where  p  is  the  probability  that  two  random  points  collide  and  71  and  are  constants. 
It  should  be  noted  that  Cooke  does  not  offer  direct  proofs  of  some  of  his  results,  he  only 
claims  that  they  are  suggested  by  the  work  of  Miles. 

For  the  case  of  covering  the  unit  two-dimensional  torus  with  random  disks  of  radius 
a,  it  follows  form  one  of  Cooke’s  claims  that 


(3.7) 


for  a  not  too  large.  The  accuracy  of  this  approximation  may  be  checked  by  comparing 
(3.7)  with  simulation  results  for  various  choices  of  a.  In  the  simulation  runs,  coverage  of 
the  torus  was  checked  by  making  sure  that  each  crossing  was  covered.  See  [54]  for  details 
of  this  method  in  a  similar  setting. 


a 

no.  trials 

observed 

(3.7) 

0.25 

20 

23.00 

14.12 

0.20 

20 

47.80 

25.62 

0.15 

20 

97.15 

53.68 

0.10 

10 

251.6 

146.6 

The  results  above  indicate  that  Cooke’s  formula  does  not  provide  very  good  approxi¬ 
mations.  Thus  it  is  probably  not  appropriate  to  make  quick  substitutions  in  results  derived 
for  a  homogeneous  planar  Poisson  point  process  as  Cooke  did.  The  work  of  Domb  [1 1] 
also  indicates  that  it  is  not  trivial  to  obtain  sequential  coverage  results  from  those  derived 
for  a  Poisson  point  process. 

£[IT]  can  be  determined  exactly  in  the  case  of  covering  the  surface  of  a  sphere  with 
random  hemispheres.  5  is  taken  to  be  the  surface  of  a  sphere  having  unit  radius,  and 
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Cj ,  C2  ■  ■  ■  are  the  centers  of  spherical  caps  of  half  angle  radius  j.  For  this  setting  it 
follows  as  a  special  case  of  a  result  of  Wendel  [59]  that 

P{W  >  Jfc}  =  2~k{k2  -  k  +  2)  (Jfc>l). 


Hence 

OO 

£[li']  =  1  +  + 

fc=i 

For  the  related  lower  dimensional  problem  of  covering  a  circle  with  hemispheres,  it  follows 
from  Wendel’s  result  that 

*t»1  ■ 1  + 1  (jct) 

*=1  v  ' 

=  5. 

The  corresponding  packing  problems  are  trivial  for  both  of  these  cases. 


Now  consider  covering  the  surface  of  a  sphere  in  3-space  with  spherical  caps  of  half 
angle  radius  a  <  ^.  The  results  of  a  simulation  study  (see  [54])  indicate  that  P{ W  >  /:}  is 
closely  approximated  by  a  formula  which  is  a  special  case  of  a  result  of  Miles  [40].  Letting 
M(o)  denote  the  smallest  number  of  caps  for  which  a  covering  is  possible,  an  improved 
approximation  is 


P{\V  >  k)  ft 


1  for  k  <  M(a) 

2 ~kk(k  -  l)sin2  q(1  +  cos a)fc-2  for  k  >  M(a). 


Hence  it  follows  that 


£[ir]*Jl/(o)+51^  £  *(*-  l)(si«2f) 


k-7 


k=M(o) 


Mia) 


s>n2  Q  x.,  .  (  ■  2  Q\ M(o)-2 

1  +  — - — A/(q)  (sin2  -) 


as  q  1  0. 


It  should  be  noted  that  M(a)  is  not  easy  to  determine  in  general. 


The  distributions  of  T  for  packing  sequences  on  spherical  surfaces  and  tori  are  difficult 
to  determine.  For  the  packing  of  n-dimensional  cubes  in  an  n-dimensiona!  torus,  a  straight¬ 
forward  extension  of  the  argument  for  the  one-dimensional  case  yields  that  E[T]  =  oc 
provided  that  the  cubes  are  not  too  large. 
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3.6.  Summary 

Although  covering  problems  are  abundant  in  the  literature,  sequential  covering  results 
concerning  £[W]  have  been  considered  by  only  a  few  authors,  including  Flatto  and  Cooke. 
Similarly,  results  concerning  T  haven't  been  previously  developed  even  though  numerous 
investigators  have  studied  random  packing  densities. 

Among  the  results  in  this  chapter,  it  has  been  shown  that  the  ratio  £[U']/£[7"]  can 
be  equal  to  zero,  less  than  one.  equal  to  one,  or  greater  than  one.  The  ratio  is  equal  to 
zero  for  the  continuous  circle  since  E  T]  =  oc.  For  the  discrete  circle,  the  ratio  equals  one 
if  c  =  1,  and  is  less  than  one  if  c  >  2  and  n  is  not  too  small.  The  ratio  exceeds  one  in  the 
simple  cubic  case  discussed  in  the  previous  section. 
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Appendix  A:  Packing  by  Hamming  Distance 

M  is  the  number  of  words  in  a  saturated  packing  of  9-ary  codewords  of  length  n 
having  minimum  allowable  interword  distance  d. 
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10 

4 

3 

2 

13.390000 

1.283221 

4000 

4  4  2 
4  4  3 


46.608000  4.988666  3000 

11.035692  0.947948  3250 


Appendix  135 


5 

2 

165.513333 

19.055703 

1200 

5 

3 

30.464400 

2.446511 

2500 

5 

4 

8.316000 

0.431621 

3000 

6 

2 

597.522730 

87.982516 

220 

6 

3 

91.980800 

5.232618 

1250 

6 

4 

22.189091 

1.263571 

1100 

t 

2 

2172.600000 

527.410526 

20 

7 

3 

293.000000 

18.621622 

75 

7 

4 

62.960000 

3.412525 

100 

3 

2 

20.997500 

1.506247 

2000 

4 

2 

91.401000 

7.225424 

1000 

4 

3 

17.402000 

0.945341 

1000 

5 

2 

405.015000 

47.411839 

200 

5 

3 

62.226000 

4.788501 

500 

5 

4 

14.246000 

1.272757 

1000 

6 

2 

1823.800000 

187.536842 

20 

6 

3 

233.180000 

16.803673 

50 

6 

4 

43.940000 

2.056970 

100 
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Appendix  B:  Packing  by  Lee  Distance 


M  denotes  the  number  of  words  in  a  saturated  packing  of  9-arv  codewords  of  length 
n  with  minimum  allowable  interword  distance  d. 


9 

n 

d 

E(\f) 

Var(AI) 

#  trials 

4 

3 

2 

20.074583 

13.625545 

2400 

4 

4 

2 

68.965417 

54.404343 

2400 

4 

5 

2 

242.035000 

147.532607 

400 

4 

6 

2 

869.071429 

542.006763 

140 

4 

1 

2 

3165.300000 

2106.900000 

10 

4 

4 

3 

16.467500 

0.763686 

4800 

4 

5 

3 

49.406250 

3.092577 

800 

4 

6 

3 

155.525000 

10.228763 

2S0 

20 


4  7  3  502.900000  18.936842 
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5 

40 

0.04902 

1000 

5 

50 

0.04894 

500 

5 

60 

0.04917 

200 

5 

70 

0.04900 

200 

5 

75 

0.04915 

200 

5 

80 

0.04896 

100 

5 

85 

0.04898 

100 

5 

90 

0.04897 

100 

5 

95 

0.04908 

100 

6 

50 

0.03416 

1000 

6 

60 

0.03415 

500 

6 

70 

0.03418 

200 

6 

80 

0.03415 

200 

6 

90 

0.03417 

100 

6 

95 

0.0340? 

100 

7 

60 

0.02448 

100 

7 

70 

0.02453 

100 

i 

80 

0.02451 

100 

7 

90 

0.02456 

100 

7 

100 

0.02452 

100 

8 

80 

0.01885 

100 

8 

90 

0.01881 

100 

8 

100 

0.01885 

100 

8 

110 

0.01882 

100 

8 

120 

0.01879 

100 

9 

100 

0.01467 

100 

9 

105 

0.0146$ 

100 

9 

110 

0.01466 

100 

9 

115 

0.01471 

100 

9 

120 

0.01462 

100 

9 

125 

0.01465 

100 

9 

130 

0.01467 

200 

10 

115 

0 01 l S3 

200 

10 

120 

0.01189 

200 

10 

125 

0.01186 

200 

10 

12« 

0.0118$ 

200 

10 

130 

0.01186 

200 

10 

132 

0.01 187 

200 

11 

120 

0.009722 

200 

11 

125 

0.009705 

200 

11 

12$ 

0.009723 

200 

11 

130 

0.009733 

200 

11 

132 

0.00972$ 

200 

12 

125 

0.008162 

100 

12 

126 

0.008154 

100 

12 

127 

0.008188 

100 

12 

12$ 

0.008165 

100 

12 

129 

0.008204 

100 

12 

130 

0.008178 

100 

12 

131 

0.008161 

100 

12 

132 

0.008194 

100 
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Appendix  C:  Packing  Square  Boxes 


p  is  the  average  proportion  of  a  q  by  q  torus  shaped  lattice  covered  by  a  saturated 
packing  of  d  by  d  blocks. 


9 

P 

#  trials 

2 

23 

0.7471 

100 

2 

26 

0.7433 

100 

2 

29 

0.7482 

100 

2 

32 

0.7496 

100 

2 

35 

0.7470 

100 

.1 

35 

0.6786 

100 

3 

40 

0.6807 

100 

3 

45 

0.6785 

100 

3 

50 

0.6817 

100 

3 

55 

0.6774 

100 

4 

60 

0.6488 

100 

4 

65 

0.6498 

100 

4 

70 

0.6484 

100 

4 

75 

0.6479 

100 

4 

80 

0.6467 

100 

5 

80 

0.6328 

100 

5 

90 

0.6304 

100 

5 

100 

0.6280 

100 

5 

110 

0.6284 

100 

5 

120 

0.6307 

100 

/  • 
■C.' 


% 
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Appendix  D:  Complementary  Codes 


n  is  the  word  length 

A(n.d)  and  K.  are  defined  in  Section  2.6 

max  is  the  number  of  words  in  the  largest  randomly  formed  code 
min  is  the  number  of  words  in  the  smallest  randomly  formed  code 
mean  is  the  average  number  of  words  randomly  packed 
f  is  the  observed  center  density 

r*  is  the  observed  center  density  for  the  corresponding  case  where  the  packing  was 
done  by  the  method  of  Section  2.2 


n 

.4 1  (i .  d ) 

max 

min 

mean 

r 

r* 

4 

2 

8 

8 

s 

8.000 

0.500 

0.388 

5 

2.3 

16 

10 

8 

9.344 

0.292 

0.346 

6 

3 

8 

4 

4 

4.000 

0.063 

0.097 

6 

2.4 

32 

32 

32 

32.000 

0.500 

0.315 

6 

2.3.4 

32 

32 

12 

21.868 

0.342 

0.315 

7 

3.4 

16 

16 

16 

16.000 

0.125 

0.078 

i 

2.5 

64 

14 

8 

12.805 

0.100 

0.290 

i 

2. 3.4. 5 

64 

44 

16 

35.875 

0.280 

0.290 

8 

4 

16 

16 

16 

16.000 

0.063 

0.032 

8 

3.5 

20 

4 

4 

4.000 

0.016 

0.064 

8 

3.4,5 

20 

16 

16 

16.000 

0.063 

0.064 

8 

2,6 

128 

16 

8 

14.616 

0.057 

0.269 

8 

2,4.6 

128 

128 

128 

128.000 

0.500 

0.269 

8 

2.3, 5.6 

128 

26 

16 

23.784 

0.093 

0.269 

8 

2, 3,4, 5,6 

128 

128 

52 

69.631 

0.272 

0.269 
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9 

4.5 

20 

16 

16 

16.000 

0.031 

9 

3.6 

40 

8 

8 

8.000 

0.016 

9 

3,4. 5.6 

40 

32 

20 

24.995 

0.049 

9 

2,7 

256 

18 

8 

16.522 

0.032 

9 

2.3.6. 7 

256 

18 

8 

17.006 

0.033 

9 

2.4. 5. 7 

256 

74 

32 

50.684 

0.099 

9 

2, 3, 4.5.6, 7 

256 

158 

98 

128.318 

0.251 

10 

5 

12 

4 

4 

4.000 

0.004 

10 

4.6 

40 

32 

20 

24.998 

0.024 

10 

4.5.6 

40 

2S 

4 

9.620 

0.009 

10 

3.7 

[72.79] 

4 

4 

4.000 

0.004 

10 

3,5.7 

[72.79] 

4 

4 

4.000 

0.004 

10 

3.4.6. 7 

[72.79] 

32 

16 

25.475 

0.025 

10 

3, 4, 5, 6. 7 

[72.79] 

52 

42 

47.260 

0.046 

10 

2.8 

512 

20 

8 

18.457 

0.018 

10 

2.3. 7. 8 

512 

20 

8 

18.234 

0.018 

10 

2.5.8 

512 

20 

8 

17.540 

0.017 

10 

2.4.6.S 

512 

512 

512 

512.000 

0.500 

10 

2.3.5. 7. 8 

512 

40 

16 

35.031 

0.034 

10 

2.3.4.6.7.S 

512 

512 

32 

316.110 

0.309 

10 

2.3. 5.6.* 

512 

512 

36 

316.583 

0.309 

10 

2.3.4.5.C.7.8 

512 

348 

212 

240.520 

0.235 

0.026 

0.055 

0.055 

0.252 

0.252 

0.252 

0.252 

0.007 

0.021 

0.021 

0.048 

0.048 

0.04S 

0.048 

0.237 

0.237 

0.237 

0.237 

0.237 

0.237 

0.237 

0.237 
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11 

5,6 

24 

24 

8 

23.760 

0.012 

0.006 

11 

4,7 

[72,79] 

16 

10 

15.784 

0.008 

0.017 

11 

4, 5, 6, 7 

[72,79] 

32 

24 

27.620 

0.013 

0.017 

11 

3,8 

[144,158] 

4 

4 

4.000 

0.002 

0.043 

11 

3.4, 7.S 

[144.158] 

32 

32 

32.000 

0.016 

0.043 

11 

3, 5,6.8 

[144.158] 

18 

12 

15.280 

0.007 

0.043 

11 

3. 4, 5, 6, 7,8 

[144,158] 

104 

84 

92.060 

0.045 

0.043 

11 

2.9 

1024 

22 

8 

20.426 

0.010 

0.223 

11 

2.3.S.9 

1024 

22 

8 

20.064 

0.010 

0.223 

11 

2.4, 7, 9 

1024 

112 

32 

73.250 

0.036 

0.223 

11 

2.5.6 .9 

1024 

24 

8 

20.921 

0.010 

0.223 

11 

2.3,4. 7.S.9 

1024 

112 

16 

42.261 

0.021 

0.223 

11 

2.3.5.6.S.9 

1024 

58 

22 

35.415 

0.017 

0.223 

11 

2.4, 5,6. 7.9 

1024 

314 

92 

189.442 

0.093 

0.223 

11 

2.3.4.0.6, 7.8.9 

1024 

530 

426 

460.100 

0.225 

0.223 

12 

6 

24 

24 

8 

23.680 

0.006 

0.002 

12 

5.7 

32 

4 

4 

4.000 

0.001 

0.004 

12 

5.6,7 

32 

24 

18 

21.810 

0.005 

0.004 

12 

4.8 

[144.158] 

32 

32 

32.000 

0.008 

0.014 

12 

4.5. 7.8 

[144.158] 

64 

20 

34.020 

0.008 

0.014 

12 

4,6.8 

[144,158] 

104 

80 

91.950 

0.022 

0.014 

12 

4.5.6. 7.8 

[144.158] 

96 

40 

62.000 

0.015 

0.014 
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12 

3.9 

256 

4 

4 

4.000 

0.001 

0.038 

12 

3.4,8. 9 

256 

32 

12 

25.120 

0.006 

0.038 

12 

3,6.9 

256 

24 

16 

20.520 

0.005 

0.038 

12 

3.5, i  ,9 

256 

4 

4 

4.000 

0.001 

0.038 

12 

3.4. 6.8. 9 

256 

106 

12 

78.620 

0.019 

0.038 

12 

3,4.5, 7.8.9 

256 

64 

44 

58.606 

0.014 

0.038 

12 

3. 5. 6. 7. 9 

256 

48 

30 

45.090 

0.011 

0.038 

12 

3, 4. 5. 6. 7,8, 9 

256 

168 

146 

157.632 

0.038 

0.038 

12 

2.10 

2048 

24 

8 

22.384 

0.005 

0.212 

12 

2.3.9,10 

2048 

24 

8 

22.060 

0.005 

0.212 

12 

2.6.10 

204S 

24 

16 

22.440 

0.005 

0.212 

12 

2,4.8.10 

2048 

134 

20 

5S.040 

0.014 

0.212 

12 

2.3.6.9.10 

2048 

26 

8 

22.544 

0.006 

0.212 

12 

2.3.4.8.9.10 

2048 

134 

16 

57.173 

0.014 

0.212 

12 

2.5.7.10 

2048 

38 

16 

34.690 

0.008 

0.212 

12 

2.4.6.8.10 

2048 

2048 

2048 

2048.000 

0.500 

0.212 

12 

2.3.5.7,9.10 

2048 

48 

16 

43.473 

0.011 

0.212 

12 

2.3.4.6.8.9,10 

2048 

2048 

30 

1683.946 

0.411 

0.212 

12 

2.4.5.7.8.10 

2048 

116 

30 

59.549 

0.015 

0.212 

12 

2.5.6,7.10 

2048 

36 

20 

27.690 

0.007 

0.212 

12 

2.3.4.5,7.8.9.10 

2048 

240 

46 

110.170 

0.027 

0.212 

12 

2.3.5.6,7.9.10 

2048 

48 

32 

43.940 

0.011 

0.212 

12 

2.4.5.6.7.8.10 

2048 

2048 

128 

823.857 

0.201 

0.212 

12 

2.3.4.5.6,7,8.2.10 

2048 

1120 

796 

867.528 

0.212 

0.212 

Joan 
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Densest  packing  schemes 


n 

d 

£  -  {n} 

f 

4 

2 

2 

0.500 

5 

2 

2.3 

0.292 

6 

3 

3 

0.063 

6 

2 

2,4 

0.500 

7 

3 

3,4 

0.125 

7 

2 

2, 3, 4, 5 

0.280 

8 

4 

4 

0.063 

8 

3 

3,4,5 

0.063 

8 

2 

2,4,6 

0.500 

9 

4 

4.5 

0.031 

9 

3 

3, 4, 5, 6 

0.049 

9 

2 

2, 3.4, 5, 6, 7 

0.251 

10 

5 

5 

0.004 

10 

4 

4,6 

0.024 

10 

3 

3.4, 5,6. 7 

0.046 

10 

2 

2, 4, 6,8 

0.500 

11 

5 

5,6 

0.012 

11 

4 

4.5, 6, 7 

0.013 

11 

3 

3, 4, 5, 6, 7. 8 

0.045 

11 

2  2, 3, 4, 5, 6.7, 8.9 

0.225 

12 

6 

6 

0.006 

12 

5 

5,6,7 

0.005 

12 

4 

4,6,8 

0.022 

12 

3 

3, 4, 5, 6, 7.8, 9 

0.038 

12 

2 

2,4,6,8,10 

0.500 

0.388 

0.346 

0.097 

0.315 

0.078 

0.290 

0.032 

0.064 

0.269 

0.026 

0.055 

0.252 

0.007 

0.021 

0.048 

0.237 

0.006 

0.017 

0.043 

0.223 

0.002 

0.004 

0.014 

0.038 

0.212 
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